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From our roots, we grow 


ZOOLOGICAL RESEARCH 


Celebrating the 60" anniversary of the Kunming Institute of Zoology, 


Chinese Academy of Sciences 


Located at the head of the Indo-Burma biodiversity hotspot, 
the Kunming Institute of Zoology (KIZ), Chinese Academy of 
Sciences (CAS), serves as China's main center for research 
into the diverse animal and ecological resources of 
southwestern China, Eastern Himalayas, and Southeast Asia. 
As of October 2019, it has been 60 years since the inception 
of KIZ. Since 1959, strong roots have been laid down by 
generations of researchers, allowing KIZ to grow and evolve 
into a comprehensive research institution renowned for its 
remarkable achievements in evolutionary mechanisms of 
animal biodiversity, animal resources protection, and 
sustainable utilization. It is now recognized as "a major 
powerhouse in evolutionary biology research in China" and is 
"establishing itself in the world stage" (Overseas Experts 
Review Committee, organized by the Bureau of Development 
Planning, CAS, during international evaluation in 2014). 

To celebrate the 60th anniversary of KIZ and the 70th 
anniversary of CAS, Zoological Research presents this 
commemorative issue, composed primarily of contributions 
from KIZ researchers. In addition, it is our great honor to 
provide here a brief retrospective of the pioneering work 
undertaken by the earlier scientists at KIZ and recent 
achievements, which will hopefully serve to motivate and 
inspire present and future successors. 

During its first two decades (1960s—1970s), KIZ carried out 
various comprehensive field surveys, including the "Joint 
Survey of Western Yunnan Province", "Sino-Vietnamese Joint 
Exploration of Fishery Resources of the Honghe Area", 
"Animal Resources Exploration of Southwestern China", "Field 
Survey of the Mountain Gaoligong" (which was also recorded 
as the first scientific documentary film of China "Wild Animals 
of Yunnan Province"), "Field Survey of the Bilou Snow 
Mountain", and "Comprehensive Exploration of the Hengduan 
Mountain". These fundamental works formed the roots from 
which subsequent research have thrived. In these surveys, 
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many animal specimens were collected, and a substantial 
amount of original data were accumulated, allowing for the 
advancement of studies in both classical zoological disciplines 
(e. g., new species identification and vertebrate fauna 
classification, comparative anatomy, and animal physiology) 
and thereafter cutting-edge subjects (e.g. de novo gene 
origin, genome evolution and adaptation, and genomic 
diversity of animals). Many remarkable breakthroughs were 
achieved during these years, including the compilation of an 
extensive vertebrate zoography, establishment of the first 
captive-bred macaque population in Menglun, 
Xishuangbanna, and a comprehensive animal field survey in 
western Sichuan Province, which was reported in KIZ's first 
foreign language academic paper published in Nature by Prof. 
Hung-Shou Pen (3 28 #2) (Pen, 1962). Sadly, Prof. Pen lost 
his life during a field survey in August 1981 due to high- 
altitude pulmonary edema, which was not only a significant 
loss to KIZ but to the scientific world at large. In 1978, KIZ 
won the "CAS Award for Major Science and Technology 
Achievements" for its timely research on the "Biological Effect 
of Nuclear Weapons". Furthermore, during many difficult field 
surveys, KIZ researchers successfully collected abundant 
Specimens, resulting in the considerable expansion of the 
Kunming Natural History Museum of Zoology, which now 
holds one of the richest tropical and sub-tropical animal 
collections in China (with over 800 000 specimens to date). 
Over the following two decades (1980s-1990s) KIZ 
emphasized and strengthened its research directions and 
increasingly developed its international cooperation. It 
established key research groups focusing on animal ecology, 
animal pathology, experimental animals, animal toxinology, 
and animal cytogenetics. Many significant innovations and 
achievements were made during this period, with wide 
international impact. For instance, Shi and coworkers 
provided strong support for the tandem chromosome fusion 
theory of Indian muntjac karyotype evolution (Shi et al., 1980), 
as mentioned by the Nobel laureate Dr. Barbara McClintock in 
her acceptance speech (McClintock, 1983). Ma et al. (1990) 
identified a new mammalian species in the genus Muntiacus 
using morphological analysis and karyotype structure 
comparison. Professor Yu-Liang Xiong and other KIZ 
coworkers advanced the treatment of snakebites with the 
invention of an emergency kit for snake envenomation (Lu et 
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al., 2010). KIZ researchers also published many books based 
on non-human primate studies, constituting the first 
monographs in the field at that time. 

In 1986, KIZ established the first wild animal cell bank of 
China under the leadership of the late Prof. Li-Ming Shi, which 
featured cell lines of endangered species and is currently 
listed as an affiliate of the Committee on Type Culture 
Collection of CAS. These rich genetic resources are valuable 
components of the animal division of the Germplasm Bank of 
Wild Species, and have received wide attention (Ryder et al., 
2000). KIZ researchers also established lymphoblastoid cell 
lines for ethnic minorities. In 1988, in cooperation with the 
Primate Center of Yunnan Province, KIZ established the "Joint 
Laboratory of Primatology", which progressed into the "China- 
US Primate Biology Laboratory" in 1996 after collaboration 
with the University of Wisconsin, USA. The Laboratory of 
Cellular and Molecular Evolution, established by Prof. Li-Ming 
Shi in 1990, was among the first laboratories to use molecular 
techniques to study animal diversity and evolution in China. 
This laboratory evolved into the "State Key Laboratory of 
Genetic Resources and Evolution" in 2007 and now serves as 
one of the core units of KIZ. These platforms and key 
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laboratories have laid a solid foundation for current 
development and future planning and have strongly influenced 
the dominant research areas of KIZ. 

In the new millennium (2000-2019), after being selected as 
a pioneer institute to undertake the "Knowledge Innovation 
Project of CAS", KIZ realigned its research focus, which has 
proven to be highly successful. With the recent "One-Three- 
Five" strategic plan, KIZ has further strengthened and 
established itself as a premier research institute within CAS, 
concentrating on regional animal biodiversity, evolution, and 
preservation (Figure 1). At present, KIZ is focused on 
accomplishing three major breakthroughs: i. e., developing 
non-human primate models for disease and pharmaceutical 
research, achieving theoretical discoveries in evolution and 
genomics, and cultivating genetic resources of domestic 
animals and their wild counterparts. Based on the "One-Three- 
Five" strategic plan, KIZ has incorporated its traditional 
strengths to adapt to new trends in frontier and 
interdisciplinary sciences. In addition, KIZ has emphasized 
innovation with profound implications for public education, 
health, conservation, policy recommendations, and scientifically 
sound management of diverse biological resources. 
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Figure 1 Dominant research areas (A) and "one body two wings" strategic planning (B) of the Kunming Institute of Zoology (KIZ), 


Chinese Academy of Sciences 
Small figure sections were taken from the web or were provided by KIZ. 


During the "One-Three-Five" strategic plan, both research 
output and scientific accomplishments have flourished. For 
instance, KIZ was granted the State Natural Science Award 
and State Technological Invention Award of China for various 
achievements, such as the "Mitochondrial genome diversity 
and population history of East Asian", "Mechanism of origin 
and genetic evolution of newly evolved genes", "Genome 
diversity and evolution of Asia populations", and "Directional 
mining technology system of medical functional components 
from poisonous animals based on organism's surviving 
strategy". 

KIZ's research on non-human primates has greatly 
advanced the development of animal models of human 
diseases resistant to existing treatment options (such as HIV/ 
AIDS, Alzheimer's disease, Parkinson's disease, and chronic 
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social defeat depression) and has helped to elucidate the 
underpinnings of these complex diseases. Moreover, the 
accumulation of knowledge on the reproductive biology of non- 
human primates has had important implications for human 
infertility and genetic manipulation of monkeys. For instance, 
Zheng et al. (2001) created the first serum-free culture 
medium for in vitro-matured rhesus monkey oocytes, which 
proved to be crucial for evaluating the effects of in vitro 
maturation on developmental capacity (Zheng, 2007). Niu et 
al. (2010) developed the first transgenic monkey in China. 
Most recently, KIZ researchers and collaborators have 
attracted world-wide attention for creating transgenic monkeys 
carrying human MCPH1 gene copies and showing human-like 
neoteny of brain development, which is the first transgenic 
monkey model used to study human brain evolution (Shi et 


al., 2019; Shi & Su, 2019). KIZ has also investigated the use 
of the Chinese tree shrew as an alternative to non-human 
primates for animal models of human diseases and has made 
remarkable progress, e.g., determination of the Chinese tree 
shrew genome (Fan et al., 2013, 2019) and creation of the 
first transgenic Chinese tree shrew (Li et al., 2017). The 
availability of genetic modification for monkeys and tree 
shrews will undoubtedly continue to facilitate animal model 
studies in KIZ. Furthermore, the rich primate resources and 
studies of KIZ have also attracted global attention (Cyranoski, 
2004; Hao, 2007). In recent years, to match the urgent need 
and rapid development of studies on non-human primate 
animal models and human disease mechanisms, KIZ 
commenced construction of the National Research Facility for 
Phenotype and Genotype of Model Animals, which is one of 
the "12th Five-Year" Major National Technological 
Infrastructure Projects. The construction of this mega science 
facility (MPF: model primate facility) will provide a cutting-edge 
platform for studying genotype-phenotype relationships using 
non-human primates and will boost original findings and 
international collaborations. The construction of this MPF was 
also featured in Nature news (Cyranoski, 2016) and will 
hopefully promote continued research using primate 
resources. 

KIZ has a long tradition for studies on animal toxins and on 
active peptides and proteins derived from natural biodiversity 
resources, such as those found in insects, centipedes, and 
venomous animals, with promising results for the development 
of novel pharmaceuticals and pain killers (Luo et al., 2018; Xu 
& Lai, 2015; Zhang, 2015). Relying on these top-tier 
resources, platforms, and talents, KIZ has been recognized 
for its achievements in vaccine and medicine development for 
HIV-1 infection, bacterial infection, and neuropsychiatric 
disorders. Several new drugs for Alzheimer's disease, 
depression, AIDS, vaginosis, and pain relief have been 
successfully developed by KIZ teams and collaborators. 
These drugs are currently on the market (e. g., analgesic 
treatment Ketongning, Tramadol Hydrochloride and Ibuprofen 
Tablets) or in clinical trial (e. g., antidepressant Orcinocide 
(phase Il of clinical trials) and anti-Alzheimer's treatment 
Phenchlobenpyrrone (phase II of clinical trials)). 

Yunnan and its surrounding regions boast some of the 
world's richest biodiversity, both in regard to wild and domestic 
animals. These areas provide excellent models for 
understanding evolution and adaptation to diverse 
environments and the reasons for high diversity. Research on 
domestic animals and wild counterparts covers a wide range 
of scientific questions, from understanding the genetic 
mechanisms associated with domestication processes to 
biodiversity monitoring and restoration. KIZ researchers have 
made remarkable achievements in these fields, e. g., 
generation of the reference genome sequences of the goat 
and sheep (Dong et al., 2013; Jiang et al., 2014), comparative 
genomic analysis of wild versus domesticated species to 
elucidate the origin and domestication of chickens and dogs 
(Wang et al. 2014), parallel evolution of dog and human 
genomes (Wang et al., 2013), and high-plateau adaptation of 


domestic animals (Wu et al., 2018). A detailed study on 
genomic selection and adaptation in animals has deepened 
our understanding of evolutionary genotype-phenotype 
systems biology (eGPS) (Wang et al., 2019), which is the core 
idea of the CAS Strategic Priority Research Program 
"Evolutionary Analysis and Functional Regulation of Animal 
Complex Traits" chaired by KIZ researchers. Based on eGPS, 
one would expect that the evolution and genetic basis of 
complex traits could be elucidated, and we could predict 
certain phenotypes based on the selected genes/loci or 
genetic manipulation of these loci (Wang et al., 2019). 
Because of its outstanding research on animal evolution and 
genetics, KIZ was approved by CAS to establish the first CAS 
Center for Excellence in Western China—"CAS Center for 
Excellence in Animal Evolution and Genetics". This center is 
hosted by KIZ, with essential input from other CAS institutes 
across China, and has the ambitious goal to become one of 
the top research centers in evolution and genetics. 

The protection of wild animal diversity and sustainable 
utilization of natural resources are key questions that KIZ 
continues to tackle. For instance, KIZ's wetland restoration 
model using indigenous species from Dianchi Lake has 
proven to be successful at controlling water pollution and 
protecting the endangered golden-line barbel 
(Sinocyclocheilus graham), as reported in Science news 
(Stone, 2008). A new breed of S. graham with very good 
nutritional components, high growth rate, and disease 
resistance was fostered during the reproduction and breeding 
of this endangered species. This serves as a good example 
for the protection and sustainable use of animal resources. 
The profound efforts expended by KIZ researchers in 
protecting rare and endangered species, e. g., Hoolock 
tianxing (Fan et al., 2017) and green peafowl (Pavo muticus) 
(Wu et al., 2019), have been instrumental in arousing both 
public and government attention. 

KIZ research into biodiversity has increasingly focused on 
making sound policy recommendations for protection as well 
as regional land and water usage. With the introduction of 
meta-barcoding for biodiversity using cutting-edge sequencing 
technologies, KIZ researchers put forward the concept of 
"biodiversity soup" for evaluating biodiversity (Ji et al., 2013; 
Yu et al., 2012). Similarly, the project "Study on Biodiversity 
Evolution and Protection of the Himalayas in China" chaired 
by KIZ has shed light on the biodiversity evolution in these 
areas, which is critical for implementing effective measures for 
the sustainable utilization of animal resources. Furthermore, 
the Cold Code project signifies KIZ's central role in the 
development of the International Barcode of Life (iBOL) for 
amphibians and reptiles (Murphy et al, 2013). Recently, 
together with Dr. Jonathan Baillie, renowned KIZ researcher 
Prof. Ya-Ping Zhang discussed the key question regarding 
how much space we should leave for other forms of life 
(Baillie & Zhang, 2018), stating that they "encourage 
governments to set minimum targets of 3096 of the oceans 
and land protected by 2030, with a focus on areas of high 
biodiversity and/or productivity, and to aim to secure 5096 by 
2050". This information will undoubtedly have positive impacts 
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on policy recommendations. 

There are many more amazing achievements over the past 
60 years that we wish could be shared. It is a credit to the 
efforts of every colleague that KIZ has evolved from a small 
working station with only a few dozen staff to an internationally 
recognized research institute with over 450 colleagues and 
400 students. Given its deep roots in the Indo-Burma 
biodiversity hotspot of southwestern China and the devotion of 
its staff, we expect KIZ will continue to grow and realize a 


promising future. 


Yong-Gang Yao, Director General 


Kunming Institute of Zoology, Chinese Academy of Sciences, 
Kunming Yunnan 650223, China 
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ABSTRACT 


Ruminants (Ruminantia) are among the most 
successful herbivorous mammals, exhibiting wide- 
ranging morphological and ecological characteristics 
(such as headgear and multichambered stomach) 
and including various key livestock species (e. g., 
cattle, buffalo, yak, sheep, and goat). Understanding 
their evolution is of great significance not only in 
Scientific research but also in applications potential 
for human society. The rapid growth of genomic 
resources provides unprecedented opportunities to 
dissect the evolutionary histories and molecular 
mechanisms underlying the distinct characteristics of 
ruminants. Here we summarize our current 
understanding of the genetic, morphological, and 
ecological diversity of ruminants and provide 
prospects for future studies. 
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INTRODUCTION 


Ruminantia mammals are among the most successful 
herbivores, and include six families: i. e., Antilocapridae, 





Open Access 

This is an open-access article distributed under the terms of the 
Creative Commons Attribution Non-Commercial License  (http:// 
creativecommons. org/licenses/by-nc/4.0/), which permits unrestricted 
non-commercial use, distribution, and reproduction in any medium, 
provided the original work is properly cited. 

Copyright €2019 Editorial Office of Zoological Research, Kunming 
Institute of Zoology, Chinese Academy of Sciences 


476 Science Press 


Bovidae, Cervidae, Giraffidae, Moschidae, and Tragulidae. 
Ruminantia comprises at least 200 extant species, with 
Bovidae the most species-rich, consisting of at least 143 
species (Heller et al., 2013), including important livestock such 
as cattle (Bos taurus), yak (Bos grunniens), sheep (Ovis 
aries), and goat (Capra hircus). Ruminants are distributed 
across extensive habitats, including different latitudes (from 
tropical to Arctic regions), different altitudes (from plains to 
plateaus), and different ecological environments (from deserts 
to rainforests). Moreover, ruminants exhibit several distinct 
anatomical features, such as headgear and multichambered 
stomach. Compared with other herbivores, such as horses, 
the rumen and omasum in ruminants enable more efficient 
utilization of plant cellulose (Clauss & Róssner, 2014; Janis, 
1976; Russell & Rychlik, 2001). This efficient procurement of 
food energy may be an important reason why ruminants are 
So prosperous. Ruminants also possess specialized dentition, 
speed, and considerable variation in body size. Furthermore, 
they have played an important role in human civilization due 
to the domestication of several species, such as cattle, 
buffalo, yak, sheep, and goat. In modern society, these 
domestic animals remain important for human diet and 
utilization. Therefore, understanding the evolution and origin 
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of ruminants is of great importance for scientific research and 
for application prospects in human society. 

In the past decade, we have witnessed the rapid 
development of genome sequencing technologies and a 
resultant wealth of knowledge from animal genomes. To date, 
the genomes of 73 ruminant species have been reported 
(Table 1), with increasing availability of resequencing and 
transcriptome data for representative species, like cattle 
(Chen et al., 2018; Daetwyler et al., 2014; Medugorac et al., 
2017; Soubrier et al., 2016), goat (Alberto et al., 2018; Wang 
et al., 2016), sheep (Alberto et al., 2018; Hu et al., 2019; 
Naval-Sanchez et al., 2018), and yak (Qiu et al., 2015; Wang 
et al., 2019a). Most previous studies have only focused on the 
evolution of individual species by decoding individual 
genomes and utilizing large-scale resequencing data; e. g., 


high-altitude adaptation studies on yak and Tibetan antelope 
(Ge et al., 2013; Qiu et al., 2012; Qiu et al., 2015). Nowadays, 
large-scale sequencing studies of multiple species across 
high classification orders have provided extraordinary 
biological discoveries, e.g., avian phylogenomics (Jarvis et al., 
2014). We recently launched the Ruminant Genome Project 
and conducted a large-scale genomic study of ruminants 
based on integrated analysis of 51 ruminant genomes (44 
newly assembled genomes and seven public genomes) and 
corresponding transcriptome, resequencing, and functional 
experiments (Chen et al., 2019; Lin et al., 2019; Wang et al., 
2019b). The availability of large-scale genomic resources 
should provide new insights into our understanding of 
ruminant diversity, evolution, biogeographic patterns, and 
adaptation mechanisms. 

















Table 1 Currently available genome assemblies for ruminant species 
Common Genome  Scaffold Contig Assembly 
Family Subfamily Species Reference 
name size(Gb) N50 (bp) N50 (bp) level 
Tragulus javanicus Java mouse- 2.59 14082842 80230 Scaffold u 
deer 
Tragulidae 
Tragulus kanchil Lesser mouse- 3.05 243 497 13 850 Scaffold Chen et al., 2019 
deer 
Antilocapridae Antilocapra americana Pronghorn 2.96 18845065 61 698 Scaffold Chen et al., 2019 
Giraffa camelopardalis Giraffe 2.47 3 153 258 25 056 Scaffold Chen et al., 2019 
Giraffidae Giraffa tippelskirchi Masai giraffe 2.71 212 164 47 894 Scaffold Agaba et al., 
2016 
Okapia johnstoni Okapi 2.88 111 538 39 571 Scaffold Chen et al., 2019 
Cervinae Axis porcinus Hog deer 2.68 20 764 858 172761 Scaffold é 
Cervinae Cervus albirostris White-lipped 2.69 3 769 372 39 627 Scaffold Chen et al., 2019 
deer 
Cervinae Cervus elaphus Red deer 3.44 107 358 006 7 944 Chromosome Bana et al., 2018 
Cervinae Elaphurus davidianus Pere David’s 2.58 2 844 142 59 950 Scaffold Zhang et al., 
deer 2018 
Hydropotinae Hydropotes inermis Chinese water 2.53 13818 975 131 446 Scaffold Wang et al., 
deer 2019 
Muntiacinae Muntiacus crinifrons Black muntjac 2.68 1 305 444 8 265 Scaffold Chen et al., 2019 
Cervidae Muntiacinae Muntiacus muntjak Indian muntjac 2.7 1 258 210 23 470 Scaffold Chen et al., 2019 
Muntiacinae Muntiacus reevesi Chinese 2.6 1 221 377 72 382 Scaffold Chen et al., 2019 
muntjac 
Odocoileinae Capreolus capreolus Western roe 2.79 10 458 4 167 Scaffold Kropatsch et al., 
deer 2013 
Odocoileinae Odocoileus hemionus Mule deer 2.34 838 758 113 295 Scaffold Russell et al., 
2019 
Odocoileinae Odocoileus virginianus White-tailed 2.38 850 721 122 019 Scaffold Seabury et al., 
deer 2011 
Odocoileinae Rangifer tarandus Reindeer 2.9 89 062 77 671 Scaffold Li et al., 2017 
Moschus berezovskii Forest musk 2.81 2 509 225 57 706 Scaffold Chen et al., 2019 
deer 
Moschidae Moschus chrysogaster Alpine musk 4.97 100 428 3 769 Scaffold Wang et al., 
deer 2019 
Moschus moschiferus Siberian musk 3.07 11728851 34785 Scaffold * 


deer 
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. Common Genome  Scaffold Contig Assembly 
Family Subfamily Species Reference 
name size(Gb) N50 (bp) N50 (bp) level 
Aepycerotinae Aepyceros melampus ` Impala 2.63 344 542 82 459 Scaffold Chen et al., 2019 
Alcelaphinae Alcelaphus buselaphus Hartebeest 3.88 12 034 889 Scaffold Chen et al., 2019 
Alcelaphinae Beatragus hunteri Hirola 2.7 69 303 57 444 Scaffold * 
Alcelaphinae Connochaetes taurinus Blue 2.64 366 224 70 608 Scaffold Chen et al., 2019 
wildebeest 
Alcelaphinae” Damaliscus lunatus Topi 3.13 1 166 796 38 843 Scaffold Chen et al., 2019 
Antilopinae Procapra przewalskii Przewalski’s 2.69 5 522 907 10 405 Scaffold Chen et al., 2019 
gazelle 
Antilopinae Antidorcas marsupialis Springbok 3.02 694 905 42 749 Scaffold Chen et al., 2019 
Antilopinae Eudorcas thomsonii Thomson’s 2.9 1 581 717 137 476 Scaffold Chen et al., 2019 
gazelle 
Antilopinae Litocranius walleri Gerenuk 2.98 3 126 223 62 351 Scaffold Chen et al., 2019 
Antilopinae Madoqua kirkii Kirk’s dik-dik ^ 2.65 27 730 27 722 Scaffold Chen et al., 2019 
Antilopinae Nanger granti Grant’s gazelle 3.03 528 456 7 454 Scaffold Chen et al., 2019 
Antilopinae Neotragus moschatus Suni 2.68 952 090 59 277 Scaffold Chen et al., 2019 
Antilopinae Neotragus pygmaeus Royal antelope 2.9 363 895 30 751 Scaffold Chen et al., 2019 
Antilopinae Oreotragus oreotragus Klipspringer 3.25 339 390 42 127 Scaffold Chen et al., 2019 
Antilopinae Ourebia ourebi Oribi 2:52 1 259 1 259 Scaffold Chen et al., 2019 
Antilopinae Raphicerus campestris Steenbok 3.14 537 161 25 050 Scaffold Chen et al., 2019 
Antilopinae Saiga tatarica Saiga antelope 2.88 6 453 6 406 Scaffold 2 
Bovinae Bison bison American 2.83 7 192 658 19 971 Scaffold $ 
bison 
Bovinae Bison bonasus European 2.58 4 690 000 14 530 Scaffold Wang et al., 
Bovidae bison 2017a 
Bovinae Bos frontalis Gayal 2.85 2 737 757 14 405 Scaffold Wang et al., 
2017b 
Bovinae Bos grunniens Domestic yak — 2.83 114 386 978 44716 738 Chromosome Qiu etal., 2012 
Bovinae Bos indicus Zebu cattle 2.67 106 310 653 28 375 Chromosome Canavez etal., 
2012 
Bovinae Bos indicus x Bos Hybrid cattle 2.68 104 466 507 26764281 Chromosome * 
taurus 
Bovinae Bos mutus Wild yak 2.65 1 407 960 22 822 Scaffold t 
Bovinae Bos taurus Cattle 2.72 103 308 737 25 896116 Chromosome Bovine Genome 
et al., 2009 
Bovinae Bubalus bubalis Water buffalo 2.66 117 219 835 22441509 Chromosome Mintoo et al., 
2019 
Bovinae Syncerus caffer African buffalo 2.93 2 316 376 52 316 Scaffold Chen et al., 2019 
Bovinae Tragelaphus buxtoni Mountain nyala 3.27 7 367 1 286 Scaffold Chen et al., 2019 
Bovinae Tragelaphus eurycerus Bongo 2.96 12 574 1974 Scaffold Chen et al., 2019 
Bovinae Tragelaphus imberbis Lesser kudu 4.04 1775 046 12 858 Scaffold Chen et al., 2019 
Bovinae Tragelaphus oryx Common eland 2.84 4 043 025 1 262 Scaffold Chen et al., 2019 
Bovinae Tragelaphus scriptus Bushbuck 2.87 890 554 28 350 Scaffold Chen et al., 2019 
Bovinae Tragelaphus spekii Sitatunga 3.73 46 427 9477 Scaffold Chen et al., 2019 
Bovinae Tragelaphus Greater kudu 2.88 511 483 33 649 Scaffold Chen et al., 2019 
strepsiceros 
Caprinae Ammotragus lervia Barbary sheep 2.65 1 301 762 52 017 Scaffold Chen et al., 2019 
Caprinae Capra aegagrus Wild goat 2.83 91317560 19347 Chromosome Dong et al., 2015 
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Common Genome Scaffold Contig Assembly 
Family Subfamily Species Reference 
name size(Gb) N50 (bp) N50 (bp) level 
Caprinae Capra hircus Goat 2.92 87 277 232 26244591 Chromosome Bickhart et al., 
2017 
Caprinae Capra ibex Alpine ibex 2.7 61905114 380983 Scaffold Chen et al., 2019 
Caprinae Capra sibirica Siberian ibex 2.73 15190720 376582 Scaffold * 
Caprinae Hemitragus hylocrius Nilgiri tahr 2.72 85 340 66 552 Scaffold * 
Caprinae Ovis ammon Argali 2.64 72 289505 151 569 Scaffold Chen et al., 2019 
Caprinae Ovis aries Sheep 2.87 107 697 089 2572683 Chromosome Jiang etal., 2014 
Caprinae Ovis canadensis Bighorn sheep 2.86 69 397 55 973 Scaffold * 
Caprinae Pseudois nayaur Bharal 2.58 21385 20 979 Scaffold Chen et al., 2019 
Cephalophi- Cephalophus harveyi — Harvey's 2.82 365 466 61 368 Scaffold Chen et al., 2019 
nae duiker 
Bovidae Cephalophi- Philantomba maxwellii Maxwell’ s 3.15 383 899 11 377 Scaffold Chen et al., 2019 
nae duiker 
Cephalophi- Sylvicapra grimmia Common 3.15 583 330 9 720 Scaffold Chen et al., 2019 
nae duiker 
Hippotraginae Hippotragus niger Sable antelope 2.6 4 586 323 45 501 Scaffold Koepfli et al., 
2019 
Hippotraginae Oryx gazella Gemsbok 2.76 1579 191 19 498 Scaffold Chen et al., 2019 
Pantholopinae Pantholops hodgsonii ^ Tibetan 24 2 772 860 18 674 Scaffold Ge etal., 2013 
antelope 
Reduncinae Kobus ellipsiprymnus Defassa 2.9 779 552 26 169 Scaffold Chen et al., 2019 
waterbuck 
Reduncinae Redunca redunca Bohor 2.72 423 407 39 382 Scaffold Chen et al., 2019 
reedbuck 


*: These genomes can be accessed from the NCBI website (https://www.ncbi.nlm.nih.gov/assembly/). 


In this review, we first summarize current knowledge of 
ruminant evolution learned from the genomic era, especially 
phylogenetic relationships, genetic basis and evolution of 
complex traits, genetic basis underlying local adaptations, and 
domestication of livestock ruminants. We then provide an 
outlook for future research on ruminants given the rapid 
advances in the accumulation of genomic resources and 
increasing efforts in functional genomics. 


PHYLOGENETIC RELATIONSHIPS OF RUMINANTS 


Phylogeny is essential to biology not only because it reflects 
the evolutionary history of species or lineages and divergent 
events, but also provides a framework for tracing the evolution 
of distinct complex characteristics. In previous studies, 
phylogenetic trees of ruminants were constructed based on 
morphological data (e.g., teeth and skeletons), fragments of 
nuclear DNA, or complete mitochondria DNA (Bibi, 2013; 
Decker et al., 2009; Gatesy et al., 1992; Hassanin & Douzery, 
2003; Hassanin et al., 2012). However, controversies in these 
phylogenetic trees remain, resulting in different placements at 
the genus, subfamily, and even family level. To resolve these 
issues, Chen et al. (2019) provided a robust phylogenetic tree 
based on whole genome data from 51 ruminants, clarifying 


most of the controversies in ruminant phylogeny. Here, using 
all available genomes (Table 1), we reconstructed a simplified 
phylogenetic tree (Figure 1). In previous studies, 
Antilocapridae was considered an outgroup to all other 
pecorans (all ruminants, excluding Tragulidae) based on 
mtDNA (Bibi, 2013; Hassanin et al, 2012; Hernández 
Fernández & Vrba, 2005). Moschidae was placed at the base 
of Pecora (Janis, 1987; Janis & Theodor, 2014; Webb & 
Taylor, 1980) or as a sister group to Bovidae based on 
morphological data (Sánchez et al., 2010), or proposed as a 
sister group of Cervidae or Bovidae based on mtDNA (Bibi, 
2014; Dos Reis et al., 2012; Hassanin & Douzery, 2003; 
Hassanin et al., 2012). The whole genome tree confirms the 
sister-group relationship between Antilocapridae and 
Giraffidae, and Moschidae as a sister group with Bovidae. In 
addition, some controversies in Bovidae (Hernandez 
Fernández & Vrba, 2005) are also resolved, with Reduncinae 
confirmed as a sister group of Caprinae, Alcelaphinae, and 
Hippotraginae. 

The above phylogenic controversies may be due to 
incomplete linage sorting (ILS) or to introgression between 
linages. A high level of phylogenetic heterogeneity is observed 
at the family level of ruminants, showing that only 21.5% of 
sample genomic windows are consistent with the topology of 
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Figure 1 Phylogenetic relationship among ruminants 
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Phylogenic tree of ruminants is presented with species within same families and subfamilies collapsed. Species used in this tree are listed in Table 


1, and the phylogenetic relationship is mainly based on Chen et al. (2019). 


the whole genome tree (Chen et al., 2019). These 
phylogenetic disagreements can primarily be explained by 
ILS, resulting from the rapid radiation of ruminant families 
during the Miocene (Chen et al., 2019). In addition, a strong 
gene flow signal has been reported among Giraffidae, 
Cervidae, Bovidae, and Moschidae (Chen et al., 2019), 
implying ancestral introgressions may also partially account 
for the phylogenic debates. In particular, pervasive 
introgression among Bos species has been reported from 
other studies (Wang et al., 2018a; Wu et al., 2018), which 
provides support for introgression effects on the phylogenetic 
relationships of ruminants. 


EVOLUTION OF IMPORTANT TRAITS 


Most traits are consequences of long-term evolution and form 
the basis of biodiversity. As one of the most successful 
mammalian linages, ruminants have evolved extensive and 
unique morphological traits, such as headgear, 
multichambered stomach, substantial variation in body size, 
cold (Arctic) adaptation (reindeer), and high-altitude 
adaptation. The availability of large-scale genomic data, 
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together with transcriptomics and functional genomics, has 
enabled investigations into the genetic basis and evolution of 
complex traits in ruminants. 


Evolution of headgear 

Ruminants are the only mammalian group with osseous 
headgear (Davis et al., 2011). Headgear is not only used in 
self-defense against predators but also in intraspecies 
competition for mates and territories (Bubenik & Bubenik, 
2012; Davis et al., 2011). Four families of ruminants have 
headgear, including Giraffidae, Antilocapridae, Cervidae, and 
Bovidae. Headgear in different families exhibits distinct 
morphological characteristics, such as ossicones in giraffi ds, 
pronged horns in pronghorns, antlers in cervids, and horns in 
bovids (Bubenik & Bubenik, 2012). The genetic mechanisms 
involved in headgear have long been a topic of interest to 
evolutionary biologists (Aristotle, 1991; Goss, 1983). 

The expression profiles, phylogenies, and convergent 
headgear losses support a single evolutionary origin of 
ruminant headgear (Chen et al., 2019; Wang et al., 2019b). 
The highly or specially expressed genes in horns and antlers 
are mostly co-expressed in bone, skin, nerve tissue, and 
testis, and many positively selected genes are associated with 


conserved elements involved in neural functions (Wang et al., 
2019b). The headgear in ruminants likely originated from 
neural crest stem cells, which diverged into horns, antlers, 
ossicones, and pronged horns in the bovids, cervids, giraffids, 
and pronghorns, respectively. The loss of headgear in 
Moschidae and Hydropotinae was likely due to convergent 
pseudogenization of the gene RXFP2 (Wang et al., 2019b). 

Interestingly, antlers in the Cervidae have evolved further 
remarkable features. Professor Richard Goss from the 
University of Brown wrote, "The antlers of deer are so 
improbable that if they had not evolved in the first place they 
would never have been conceived even in the wildest 
fantasies of the most imaginative biologists" (Goss, 1983). 
Deer antlers are the only completely regenerative organ found 
in mammals. They exhibit extremely rapid growth rates (~1.7 
cm/day in red deer) that can even surpass cancerous tissue 
growth in cell proliferation (Goss, 1983), although deer 
experience low rates of cancer (Griner, 1983; Lombard & 
Witte, 1959). Wang et al. (2019b) showed that antler-specific 
expressed genes are enriched in the axon guidance pathway, 
suggesting that antler growth may be involved in neural 
processes. Strikingly, their results also revealed that proto- 
oncogenes (FOS, REL, FAM83A) and tumor suppression 
genes are positively selected in cervids, thus explaining the 
high growth rate of deer antlers but low cancer rate in cervids. 
Understanding the genetic basis of rapid antler regeneration 
could provide new insights into the regeneration of 
mammalian organs and oncogenesis. 


Evolution of multichambered stomach 

The ruminant digestive system plays important ecological and 
functional roles in evolution. The appearance of the rumen 
and its close interaction with microorganisms allows ruminants 
to gain energy efficiently, thus providing unique evolutionary 
advantages and promoting the prosperity and diversity of 
ruminant groups. Compared with non-ruminants (e.g., horses), 
ruminants can achieve higher utilization of plant fiber due to 
their unique forestomach fermentation system (Janis, 1976). 
Most previous studies have focused on the interaction 
between microorganisms and the rumen (Bovine Genome 
Sequencing and Analysis Consortium et al., 2009; Seshadri et 
al., 2018; Stewart et al., 2018; Zhang et al., 2016). However, 
little is known about the origin and evolution of the 
multichambered stomach. 

To investigate the origin and evolution of the ruminant 
stomach, Chen et al. (2019) compared gene expression 
profiles from 516 samples covering 50 tissues of sheep, 
together with large-scale genomic data. The expression 
profiles suggested that the rumen, reticulum, and omasum 
may have originated from the esophagus, as also suggested 
by Warner (1958) and Xiang et al. (2016), whereas the 
abomasum was closer to the intestine. Newly evolved genes 
also seem to have played an important role in the evolution of 
the rumen. Seven newly evolved genes identified at the 
ancestor of ruminants are specially expressed in the rumen, 
two of which, PRD-SPRRII and TCHHL2, are structural genes 
(Chen et al., 2019; Jiang et al., 2014). PRD-SPRRII is related 


to the cornification of the keratin-rich surface of the rumen, 
and TCHHL2 plays a role in cross-linking keratins at the 
rumen surface (Jiang et al., 2014). The omasum is a newly 
evolved organ in pecorans, resembling the rumen in structure 
and function (Millen et al., 2016). Among the 75 genes 
specifically and highly expressed in the omasum compared 
with other organs, one gene is newly evolved 
(LOC101107119) and another (SCNN1D) exhibits pecoran- 
specific amino acid changes (Chen et al., 2019). In addition, 
four genes (i.e., SIM2, PAX9, KCNK5, and DENND2C) show 
pecoran-specific conserved non-exonic elements (CNEs) 
within their immediate upstream/downstream 10 kb regions 
(Chen et al., 2019). Based on gene family analysis, the 
lysozyme c family is also expanded in ruminants, containing 
ten or more copies (Chen et al., 2019; Irwin, 2015), whereas 
other outgroup mammals have only one or a few copies. 
Furthermore, most expanded copies of lysozyme c are 
expressed in the abomasum, suggesting adaptation to its 
microbe-rich environment. 


Evolution of body size 

Ruminants exhibit extensive variation in body size, from 2 kg 
(e.g., lesser mouse deer, Tragulus kanchil) to 1 200 kg (e.g., 
giraffe, Giraffa camelopardalis and African buffalo, Syncerus 
caffer) (Castelló, 2016). At the family level, body size is largest 
in Giraffidae, followed by medium-sized Antilocapridae and 
Cervidae, and relatively small-sized Tragulidae and 
Moschidae (Clauss et al., 2003). Bovidae exhibits varied body 
size, with Bovinae generally being the largest (Castelld, 
2016). Ruminants living in open grasslands also tend to be 
larger, which can help in resisting predators and adapting to 
changes in the environment. Ruminants living in the jungle 
tend to be smaller in size, which allows for more flexible 
movement and thus predator avoidance. 

Genes associated with body size have significantly higher 
non-synonymous substitution rates in large- and small-sized 
bovid species (Chen et al., 2019). Specifically, six genes (i.e., 
CXCL13, RNF115, NPNT, KL, SLC9A3R1, and MSTN) are 
associated with increased body size and five genes (i. e., 
SBDS, BMP3, LRRN3, NFATC3, and SMARCAL 1) are related 
to reduced body size via regulation of bone and muscle 
development. Several genes also have functional support 
from other studies. For example, SLC9A3R7 plays a role in 
osteogenesis by mineralizing osteoblasts, with disruption of 
this gene resulting in reduced body weight in mice (Shenolikar 
et al, 2002). MSTN regulates muscle cell growth and 
differentiation (Sartori et al., 2013), and mutations in MSTN 
can affect muscle mass in livestock species such goat, sheep, 
and cattle (Grobet et al., 1997; Li et al., 2016; Luo et al., 2014; 
Wang et al., 2018b) and other mammals (Bi et al., 2016; Gu et 
al., 2016). SBDS serves an important role in cell proliferation 
and is a causal gene of Shwachman-Diamond syndrome in 
humans (Boocock et al., 2003), which is characterized by 
short stature and skeletal abnormalities. For individual 
species, the special stature of giraffes enables access to food 
and is an adaptation to tropical grassland. Those genes 
associated with bone development with  giraffe-specific 
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mutations are involved in the transforming growth factor-b 
(TGF-b), Hedgehog, Notch, Wnt, and FGF signaling pathways 
(Agaba et al., 2016; Chen et al., 2019). 


Reindeer adaptation to Arctic regions 

Reindeer (Rangifer tarandus) are naturally distributed across 
Arctic and sub-Arctic regions, and consequently face 
numerous challenges, including severe cold, limited food 
availability, and prolonged periods of light or darkness (Blix, 
2016). To adapt to these environments, reindeer have evolved 
several strategies, including special lipid metabolism to avoid 
heat loss and to induce circadian arrhythmicity for long 
periods of light or dark (Lu et al., 2010; Stokkan et al., 2007; 
Van Oort et al., 2005). Based on detailed comparative 
genomic analysis, Lin et al. (2019) identified two positively 
selected genes (i. e., CYP27B1 and POR) involved in the 
vitamin D metabolism pathway. Functional experiments have 
also shown that enzymes encoded by these two genes exhibit 
much higher catalytic activity than that of their orthologs in 
goat and roe deer (Lin et al., 2019). Eight genes with reindeer- 
specific mutations have been reported to be involved in the 
circadian rhythm pathway, with four being rapidly evolving 
genes. Among these genes, Pro1172Thr (P1172T) mutation in 
PER2 results in loss of binding ability with CRY1, which can 
cause arrhythmicity (Lin et al., 2019). 


Adaptation to high altitude 

Ruminants are distributed across a wide range of habitats, 
including high-altitude regions such as the Tibetan Plateau, 
with risks of hypobaric hypoxia. These ruminants include the 
yak, Tibetan sheep, and Tibetan goat, which are three key 
livestock species serving as sources of meat and fiber for 
Tibetan inhabitants (Hu et al., 2019; Qiu et al., 2012; Wang et 
al., 2016). In addition, the Tibetan antelope (Pantholops 
hodgsonii) and Marco Polo sheep (Ovis ammon polii) also 
exhibit adaptation to high altitude (Ge et al., 2013; Yang et al., 
2017). Therefore, understanding the genetic basis and 
mechanisms underlying high-altitude adaptation is of great 
value in many scientific fields. 

Genetic variation related to transcription factors, oxygen 
sensors, and target genes has been exploited to detect 
signals of selection in ruminant species such as the Tibetan 
sheep (Hu et al., 2019), Tibetan goat (Wang et al., 2016), yak 
(Qiu et al., 2012), and Tibetan antelope (Ge et al., 2013). In 
yak, three genes, identified as positively selected in response 
to hypoxia, are involved in hypoxia-inducible factor-1a (Hif- 
1a), including two important regulators (Adam17 and Arg2) 
and one target gene (Mmp3) (Qiu et al., 2012). In Marco Polo 
sheep, several genes associated with hypoxia response have 
been identified, including ryanodine receptor 1 (RYR1) and 
purinergic receptor P2X and ligand-gated ion channel 3 
(P2RX3) (Yang et al., 2017). In addition to whole genome 
analysis in yak, Marco Polo sheep, and Tibetan antelope, 
resequencing analysis between low- and  high-altitude 
ruminant populations has also identified candidate genes 
associated with hypoxia response, especially the HIF-1 
pathway (Hu et al., 2019; Wang et al., 2016), including one 
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gene (NOXA1) in goats (Wang et al., 2016) and eight genes 
(EPO, TLR4, PIK3CA, PRKCA, EGLN3, EGLN2, IFNGR2, 
and CUL2) with strong signals in sheep (Hu et al., 2019). 
Although candidate genes under positive selection can 
contribute to high-altitude adaptation, introgression may also 
play a crucial role. For instance, genes involved in the 
response-to-hypoxia pathway (e. g., EGLN1, EGLN2 and 
HIF3a) are introgressed from yak to Tibetan cattle, which may 
be responsible for their high-altitude adaptation (Wu et al., 
2018). Of these genes, EGLN7 is reported to be associated 
with high-altitude adaptation in Tibetans (Simonson et al., 
2010). Therefore, these positively selected genes may provide 
valuable insights into plateau medicine. 


DOMESTICATION EFFECTS ON RUMINANT LIVESTOCK 


The importance of ruminants is also highlighted by several 
major livestock species, including goat, sheep, cattle, and yak. 
Domestication of livestock ruminants was a key factor in the 
transition from — hunter-gather societies to agricultural 
civilizations. Under intense artificial selection, domesticated 
livestock species have experienced extensive morphological 
or physiological changes compared to their wild relatives, 
including shadow color coat, floppy ears, shorter muzzles, 
smaller brain and cranial capacity, and tameness, which are 
referred to as domestication syndromes (Wilkins et al., 2014). 

It has been hypothesized that domestication syndromes 
predominantly resulted from mild neural crest cell (NCC) 
deficits during embryonic development (Wilkins et al., 2014). 
Several recent studies have revealed that selected candidate 
regions under domestication of cattle, yak, sheep, and goat 
are enriched in pathways related to brain development or 
neurobehavioral functioning, such as NCCs (Alberto et al., 
2018; Naval-Sanchez et al., 2018; Qanbari et al., 2014; Qiu et 
al., 2015). For example, neural genes (e. g., TMEM132D, 
CACNA1C, NRXN1, and NPAS3) show signals of positive 
selection in cattle (Qanbari et al., 2014). Thirty genes in yak 
show signals of selection involved in brain and neuronal 
development, 19 of which are associated with behavior (Qiu et 
al., 2015). In addition, five genes are associated with the 
nerve system in sheep and goat, which showed convergent 
evolution (Alberto et al., 2018). These genes associated with 
the brain and nerve system under pressure from artificial 
selection may explain tameness or aggressiveness in 
livestock species. 

In addition to major livestock species, reindeer were also 
domesticated by sub-Arctic people. Although cervid species 
are usually cautious and sensitive, reindeer are very tame and 
are the only domesticated species in the cervid family. Lin et 
al. (2019) identified reindeer-specific mutations in genes 
associated with NCCs to explain their tameness, and 
observed that genes involved in development (MSX2, ID3, 
BCAT1, CAD6, and CAD11), migration (TCOF1, BCAT1, 
NOTCH2, and NOTCH3), and differentiation (COL2A1, KIT, 
and S/) of NCCs were either rapidly evolving or demonstrated 
reindeer-specific mutations. 


PERSPECTIVES 


Traditional biological methods are often unable to solve the 
genetic basis of the evolution of complex traits across higher 
taxa at the family, order, or even class level. To address this 
issue, we propose a new approach called evolutionary 
genotype-phenotype systems biology (eGPS), which 
compares the evolution of many species using large-scale 
genome, transcriptome, regulatome, | metabolome, and 
proteome data and systematically integrates evidence from 
genetic factors, developmental network evolution, and 
phenotypes to analyze the genetic basis of complex animal 
traits, especially those with evolutionary significance. Using 
this approach, we could resolve the genetic basis of complex 
traits and determine key regulatory genes or pathways. 
Functional experiments could verify these genetic factors, and 
thus facilitate further investigation on the evolutionary 
mechanisms of complex traits and their potential application. 
The Ruminant Genome Project has served as a good 
example of eGPS study with large-scale multi-omics data. 

The rapid development of sequencing technology, 
especially long-read and Hi-C technologies, has enabled 
researchers to obtain high-quality reference genomes with 
long continuous contigs and scaffold size, even to the 
chromosome level. High-quality assemblies should promote 
additional studies on ruminants, including chromosome 
evolution, accurate identification of new functional elements (e. 
g., regulatory elements and new genes), and functional 
validation using gene editing technology such as CRISPR/ 
Cas9. Chromosome numbers in Muntiacus species 
demonstrate considerable variations due to chromosome 
fusion; for example, Muntiacus reevesi has a karyotype of 2n= 
46, Muntiacus vaginalis has a karyotype of 2n=6 and 7, and 
Muntiacus crinifrons has a karyotype of 2n=8 and 9 in females 
and males, respectively (Shi, 1983; Wurster & Benirschke, 
1967, 1970). High-quality reference genomes will help to 
reveal the genetic mechanisms underlying the recurrent fusion 
of chromosomes in muntjacs and its effects on the interaction 
between genes flanking the fusion regions, and thus facilitate 
understanding of speciation driven by chromosome variation. 
New regulatory elements and genes may be crucial in 
adaptive phenotypic diversification. Availability of high-quality 
genome data should facilitate identification of these new 
functional elements, thus revealing the potential genetic basis 
underlying adaptations. For casual or functional mutations 
identified using large-scale data among linages, CRISPR/ 
Cas9 experiments could be utilized for verification using 
model organisms or key livestock species, such as sheep and 
goats. 

Additionally, genomic resources and comparative analyses 
of ruminants are likely to contribute to animal husbandry 
(Figure 2). Comparative analysis between domestic livestock 
species and wild species has identified a series of candidate 
genes associated with domestication or economically valuable 
traits, such as meat quality and milk production (Alberto et al., 
2018). These artificially selected genes can provide new 
targets for animal breeding and improvement. In the future, 


gene editing technology (such as CRISPR/Cas9) may be 
applicable to quickly improve traits of livestock. 
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Figure 2 Possible applications of future ruminant research 
Studies on ruminants could be applied in cancer research, 
regenerative biology, hypertension, and cardiovascular research, as 
well as breeding of livestock. 


In addition to application in agriculture, ruminants could 
serve as animal models for medical research and thus provide 
valuable insights into medical treatments and even clinical 
studies (Figure 2). Further research on deer antlers may offer 
unprecedented clues on regenerative biology and 
oncogenesis. Moreover, study on giraffes may provide insight 
into hypertension or cardiovascular research. Overall, 
increasing our understanding of the genetic basis and 
mechanisms underlying certain ruminant traits could benefit 
human society in many ways. 
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ABSTRACT 


The discovery of antibiotics marked a golden age in 
the revolution of human medicine. However, 
decades later, bacterial infections remain a global 
healthcare threat, and a return to the pre-antibiotic 
era seems inevitable if stringent measures are not 
adopted to curb the rapid emergence and spread of 
multidrug resistance and the indiscriminate use of 
antibiotics. In hospital settings, multidrug resistant 
(MDR) pathogens, including carbapenem-resistant 
Pseudomonas aeruginosa,  vancomycin-resistant 
enterococci (VRE), methicillin-resistant 
Staphylococcus aureus (MRSA), and extended- 
spectrum B-lactamases (ESBL) bearing 
Acinetobacter baumannii, Escherichia coli, and 
Klebsiella pneumoniae are amongst the most 
problematic due to the paucity of treatment options, 
increased hospital stay, and exorbitant medical 
costs. Antimicrobial peptides (AMPs) provide an 
excellent potential strategy for combating these 
threats. Compared to empirical antibiotics, they 
show low tendency to select for resistance, rapid 
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killing action, broad-spectrum activity, and 
extraordinary clinical efficacy against several MDR 
strains. Therefore, this review highlights multidrug 
resistance among nosocomial bacterial pathogens 
and its implications and reiterates the importance of 
AMPs as next-generation antibiotics for combating 
MDR superbugs. 
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INTRODUCTION 


A perturbing prediction by the World Health Organization 
(WHO) is that by the year 2050, drug-resistant infections, 
largely exacerbated by the indiscriminate use of antibiotics, 
will kill 10 million people annually, ignite a financial cataclysm, 
and impose extreme poverty upon millions of people (de 
Kraker et al., 2016). The increasing incidence and prevalence 
of antibiotic resistance among nosocomial bacterial pathogens 
and the rapid spread of resistance genes in the environment 





Received: 23 July 2019; Accepted: 26 September 2019; Online: 26 
September 2019 

Foundation items: This work was supported by the National Natural 
Science Foundation of China (21761142002 and 31801975), Chinese 
Academy of Sciences (XDB31000000, SAJC201606, KFZD-SW-219- 
2, KFJ-BRP-008, and KGFZD-135-17-011), and Yunnan Province 
Grant (2015HA023) 

"Corresponding author, E-mail: zhangzhiye@mail.kiz.ac.cn 

DOI: 10.24272/j.issn.2095-8137.2019.062 


Zoological Research 40(6): 488-505, 2019 


are major global healthcare threats due to the associated 
increase in morbidity and mortality and huge burden on the 
economy (Aslam et al., 2018; Zaman et al., 2017). 

Since the discovery of penicillin in 1928 by Alexander 
Fleming, antibiotics have saved countless lives and marked 
an important medical revolution in plant, animal, and human 
prophylaxis. Unfortunately, the continued use of antibiotics 
has been accompanied by the rapid emergence and spread of 
multidrug resistant (MDR) strains and many medical 
specialists are warning of an inevitable return to the pre- 
antibiotic era (Adedeji, 2016). For example, bacterial genomic 
analysis has revealed the presence of more than 20 000 
genes associated with resistance (Liu & Pop, 2009), with a 
higher number expected over the coming years. Therefore, 
stringent measures are necessary to curb the spread of 
resistant strains as they present a major challenge to global 
public health not only in the war against microbial infections, 
but also in other clinical applications such as cancer 
treatment, invasive surgery, and graft transplantation (Gudiol 
& Carratala, 2014; Lupei et al., 2010). 

Several factors that have been implicated in the current 
upsurge of antimicrobial resistance (AMR) in hospitals and the 
community, including the indiscriminate use of antibiotics in 
human and animal medicine and in agricultural practices 
involving growth promoters (Silveira et al., 2009; van Boeckel 
et al., 2015), lack of proper regulation regarding over-the- 
counter antibiotics, especially in developing countries where 
they are easily available without proper medical prescription 
(Ayukekbong et al., 2017), and poor sanitation practices 
leading to the introduction of unmetabolized antibiotics into 
the environment through human and animal waste (Davies & 
Davies, 2010). 

Natural selection is an inherent process and key driver of 
evolution, conferring organisms with traits for increased 
environmental adaptability and survival (Jesus et al., 2002). 
Over the years, the widespread use of antibiotics has led to 
the selection of diverse strains of microorganisms possessing 
MDR traits or genes (Martinez & Baquero, 2002; Ochoa et al., 
2009). Such resistance-conferring properties, through 
bacterial and genomic associations, can be easily transferred 
across different ecological niches, e. g., from animals to 
humans and vice versa, and from hospital settings to the 
community, where they present an unprecedented crisis 
(Groisman & Ochman, 1996; Kümmerer, 2004; Phillips et al., 
2004). 

In bacteria, mechanisms conferring resistance to almost all 
available classes of antibiotics have been studied extensively 
and described in detail in previous literature (Blair et al., 2014; 
Dever & Dermody, 1991; Lin et al., 2015; Lombardi et al., 
2019; Morita et al, 2012). These mechanisms include 
enzymatic degradation of antibiotics, drug-target modification, 
altered membrane permeability, and enhanced expression of 
efflux pumps that actively eliminate antibiotics (Alanis, 2005; 
Laxminarayan & Brown, 2001; Munita & Arias, 2016). 

Initially associated with severe nosocomial infections in 
immunocompromised patients, multidrug resistance has 
spread to the wider community, resulting in severe infections 


associated with growing death tolls and huge economic 
burdens due to increased disability and high medical costs 
(Jing et al., 2019; Peters et al., 2019). Currently, common 
problematic MDR bacteria include methicillin-resistant S. 
aureus (MRSA), vancomycin-resistant MRSA, MDR P 
aeruginosa, carbapenem-resistant A. baumannii, E. coli, and 
K. pneumoniae, vancomycin-resistant enterococci (VRE), and 
extensively drug-resistant (XDR) Mycobacterium tuberculosis 
(Levin et al., 1999; Miller et al., 2005). Recent reports have 
also indicated cases of bacterial strains completely resistant 
to all available antibiotics. For example, the extensive use of 
colistin, a drug of last resort for the treatment of MDR 
pathogens such as P aeruginosa, A. baumannii, and K. 
pneumoniae in both human medicine and agriculture, has led 
to the emergence of a plasmid-mediated MCR-1 gene that 
encodes for its resistance (Liu et al., 2016; MacNair et al., 
2018; Paterson & Harris, 2016). 

The increasing incidence of infections resulting from MDR 
pathogens in clinical settings has intensified the demand for 
alternative therapies. Antimicrobial peptides (AMPs) with 
potent antimicrobial activities and diverse mechanisms of 
action (MOA) are considered important alternatives to solving 
the issues of multidrug resistance. 


IMPLICATIONS OF MULTIDRUG RESISTANCE IN 
NOSOCOMIAL PATHOGENS 


Epidemiological surveillance data worldwide indicate that 
anomalous use of antibiotics has resulted in the evolution of 
several human pathogens into MDR strains that are highly 
tolerant or resistant to antibiotic therapies, thus posing a 
serious threat to public health (Davies & Davies, 2010; WHO, 
2014). Multidrug resistance, especially in nosocomial 
pathogens, is of great clinical concern due to the increased 
morbidity and mortality and enhanced virulence and 
transmissibility (Bhat et al., 2006; Khan et al., 2017). Such 
pathogens are implicated in severe infections such as 
ventilator-associated pneumonia (Koenig & Truwit, 2006) as 
well as bloodstream (Martinez & Wolk, 2016), surgical site 
(Anderson, 2011), and implant-associated urinary tract 
infections (Nicolle, 2014). These infections occur often and 
are severe in immunocompromised patients, although recent 
evidence also suggests the spread of MDR genes into the 
general community (Khan et al., 2017). 

For gram-negative nosocomial pathogens, especially A. 
baumannii, P. aeruginosa, K. pneumoniae, S. aureus, and 
Enterobacter spp., the emergence of multidrug resistance to 
several available classes of antibiotics, such as penicillins, 
aminoglycosides, cephalosporins, and fluoroquinolones, has 
increased over the years as a result of extensive use, 
especially in intensive care units (Richard et al., 1994; 
Struelens, 1998). In many cases, these MDR strains show 
reduced susceptibility to all available antibiotic therapies and 
are implicated in serious high mortality rate-nosocomial 
infections (Hirsch & Tam, 2010; Manchanda et al., 2010). 
Colistin, an AMP consisting of two polypeptides (polymyxin A 
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and B), is often used as a drug of last resort for MDR strains, 
notwithstanding its adverse side effects such as nephrotoxicity 
and neurotoxicity, due to the paucity of treatment options 
(Falagas et al., 2005; Yamamoto et al., 2018). However, 
current reports indicate the emergence of MDR strains that 
exhibit reduced susceptibility to colistin following long-term 
clinical or laboratory exposure (Jeannot et al., 2017). 
Resistance to colistin is attributed to the presence of the MCR- 
1 gene, which is highly transmissible across different bacterial 
strains and has been isolated from strains on hospital 
surfaces and in animal and clinical human samples 
(Yamamoto et al, 2018). These strains also exhibit low 
susceptibility to almost all other antibiotics, including those 
designated as "last-resort" such as polymyxins (colistin) and 
carbapenems, thus raising a new crisis in the war against 
multidrug resistance (Liu et al., 2016; Rapoport et al., 2016; 
Yamamoto et al., 2018). 

Gram-positive S. aureus is another important nosocomial 
pathogen and is transmitted through direct contact with 
contaminated surfaces, clinical waste, open wounds, and 
clinical staff (Denis, 2017; Price et al., 2017). It is a major 
cause of skin infections, sepsis, and pneumonia in 
hospitalized patients, with recent data indicating increased 
incidences of community-acquired S. aureus infections due to 
improper disposal of hospital waste, poor hygiene, and the 
spread of resistant genes to the community (Dotel et al., 
2017). The rising incidence of MRSA infections is a growing 
healthcare threat associated with increased mortality rates in 
hospitals and the community (Othman et al, 2019). 
Additionally, most MRSA strains exhibit resistance to all other 
antibiotics, including the beta-lactams, and recent reports 
indicate the emergence and spread of strains with reduced 
susceptibility to glycopeptide compounds such as vancomycin 
(vancomycin-resistant MRSA), therefore making treatment 
almost impossible (Bhat et al., 2006; Centers for Disease & 
Prevention, 1997; Othman et al., 2019). 

Over the past few years, enterococci, which are normal flora 
of the gut and genital tract, have become problematic 
nosocomial pathogens and a growing clinical predicament. 
This trend is concomitant with their inherent resistance to 
several commonly used drugs, including penicillin, ampicillin, 
cephalosporin, and clindamycin (Gold & Moellering, 1996). In 
addition to inherently acquired resistance, enterococci can 
rapidly acquire resistance to virtually any antibiotic either 
through rapid mutation or the acquisition of foreign genetic 
material, thus expressing a repertory of resistance 
mechanisms to antibiotics such as enhanced expression of 
efflux pumps, modification of drug targets, and enzymatic 
degradation of drug agents (Linden, 2002). MDR enterococci 
exhibit high adaptability under antibiotic pressure and can 
rapidly acquire resistant genes to enhance their survival. One 
notable example is the rapid transfer of resistant genes 
associated with the extensive use of vancomycin among 
several strains of enterococci, especially Enterococcus 
faecalis and Enterococcus faecium, two important nosocomial 
pathogens (Arias & Murray, 2012). Resistance to vancomycin 
presents a major clinical crisis as most of these strains are 
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also resistant to most other antibiotics and the transfer of 
vancomycin-resistant genes can occur from enterococci 
strains to even more lethal pathogens such as MRSA, 
therefore leaving very few or no therapeutic options (Chang et 
al., 2003; Franchi et al., 1999; Noble et al., 1992). 

Bacterial biofilms are of great clinical significance to global 
healthcare due to their important role in nosocomial and 
implant-related infections. Most nosocomial pathogens 
produce biofilms, which complement disease pathogenicity 
and resistance (Dunne, 2002; Gurung et al., 2013). Biofilm- 
producing organisms such as A. baumannii and P. aeruginosa 
exhibit extreme resistance to almost all antibiotics compared 
with non-biofilm-producing microorganisms. Biofilms form a 
protective coating around bacterial cells, thus hindering the 
killing action of antibiotics (Stewart, 2002). Other factors that 
contribute to resistance in biofilm-producing organisms 
include the formation of persister cells (dormant and highly 
resilient to almost all available antibiotics), slow bacterial 
growth within the biofilm, and adaptability to stressful 
conditions (Keren et al., 2012; Lewis, 2008). 

Multidrug resistance to useful classes of antibiotics has 
increased gradually among several bacterial nosocomial 
pathogens. Thus, great efforts have been expended on the 
discovery of novel antibiotic alternative therapies. Currently, 
several AMPs with potent efficacy are under clinical trial and 
present excellent mitigation strategies for the multidrug 
resistance crisis (de Breij et al., 2018; Koo & Seo, 2019; 
Stefania et al., 2015). Moreover, compared with traditional 
antibiotics, AMPs possess many important qualities that make 
them excellent candidates for therapeutic exploitation. For 
instance, AMPs are multifunctional with diverse MOA, 
including membrane disruption, inhibition of DNA and protein 
synthesis, and impairment of key cellular processes such as 
metabolism and cell wall synthesis (Kumar et al., 2018; Tzong- 
Hsien et al., 2016; Wimley, 2010). Their diverse MOA are 
important as they minimize the tendency of pathogens to 
select for resistance, as observed in most conventional 
antibiotics that only act on single targets unless used in 
combination (Alanis, 2005; Bechinger & Gorr, 2017). 
Moreover, most AMPs exhibit potent antimicrobial properties 
against both gram-negative and gram-positive bacteria, fungi, 
and viruses in the nanomolar and micromolar range (Frecer et 
al., 2004; Wakabayashi et al., 1996; Yamauchi et al., 1993), 
rapidly kill pathogens within minutes, and have a low proclivity 
to select for resistance compared with conventional antibiotics 
(de Breij et al., 2018; Nagarajan et al., 2019). 


AMPs: ORIGIN AND PROSPECTS 


AMPs or host defense peptides are a notably class of 
compounds of the innate immune system with both 
microbicidal and immunomodulatory activities, providing a first 
line of defense against pathogenic invasion (Falanga & 
Galdiero, 2018; Kang et al., 2017; Pasupuleti et al., 2012). 
AMPs are polypeptides of varying molecular weight and 
amino acid residues (ranging from five to over 100) and are 


found in virtually all living organisms, from simple prokaryotes 
to complex eukaryotes (Figure 1). They play important roles in 
the direct killing of infectious agents, viruses, bacteria, fungi, 
and parasites and through the modulation of immune 
processes such as activation and recruitment of immune cells, 
wound healing, angiogenesis, and inflammation (Haney & 
Hancock, 2013; Kumar et al., 2018; Maroti et al., 2011; 
Oppenheim et al., 2003; Zhao et al., 2013). 


© Protists (8) 
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Figure 1 Number of natural and synthetic antimicrobial peptides 
from different kingdoms (total 3 011) as of July 2019 

Data obtained from antimicrobial peptide database http://aps. unmc. 
edu/AP/ (Wang et al., 2016). 


In most organisms, high concentrations of AMPs are 
expressed on surfaces that are constantly exposed to 
pathogens. For example, the mucosal lining and skin of 
vertebrates display broad-spectrum antimicrobial activities 
(Brogden et al., 2003; Rinaldi., 2002). In general, AMPs are 
either produced continuously or up-regulated following 
exposure to a threat (Ganz, 2003). For example, human skin 
is continuously protected by an abundance of psoriasin, 
dermcidin, and lactoferrin, with cathelicidin LL-37 also up- 
regulated following infection (Harder et al., 2013). In bacteria, 
AMPs play an important role in enhancing adaptability and 
survival during antagonistic competition for resources with 
other bacteria occupying the same ecological niche (Kumar et 
al., 2018). Several peptides with potential therapeutic activity 
have been isolated from bacteria and demonstrate potent 
antimicrobial activities against both gram-negative and gram- 
positive bacteria and even fungi. These AMPs are known as 
bacteriocins and play important roles in inhibiting or killing 
antagonistic bacterial strains with no self-harm, thus 
conferring a critical survival strategy (Yang et al., 2014). 
Bacteriocins provide important prospects for the development 
of alternative antibiotic strategies against different bacterial 
pathogens and are also useful in the food industry as 
preservatives (Mills et al., 2011; Silva et al., 2018). Bacteria- 
derived AMPs include nisin, a 34-amino acid residue peptide 
and an important food preservative isolated from Lactococcus 
lactis (Tong et al, 2014), and colistin, isolated from 
Paenibacillus polymyxa and used extensively in agriculture 
and human prophylaxis (Poirel et al., 2017). Colistin is an 
important last-resort peptide drug used against several MDR 
nosocomial pathogens such as MRSA, MDR P aeruginosa, 
and carbapenem-resistant pathogens, although its use is 
somewhat limited due to its side effects (Falagas et al., 2005; 


Levin et al., 1999; MacNair et al., 2018). 

In vertebrates, AMPs can directly kill microorganisms and 
play a role in immunomodulation through the activation and 
recruitment of immune cells during infection (Kumar et al., 
2018; Mahlapuu et al., 2016). Several different classes of 
AMPs, such as cathelicidins and defensins, have been 
isolated and characterized from immune cells, bodily 
secretions, and epithelial tissues of amphibians and marine 
and terrestrial animals (Lu et al., 2008; Wang et al., 2016). For 
example, several AMPs have been identified from amphibian 
skins, where they are produced in glands of the dermal skin 
layer and released following pathogenic exposure, inducing a 
microbicidal action through membrane disruption (Rollins- 
Smith et al., 2005; Woodhams et al., 2007). In humans, 
psoriasin, dermcidin, and lactoferrin are continuously secreted 
by the sweat glands to form an important barrier against 
infection, and cathelicidin LL-37 is secreted following microbial 
introduction (Harder et al., 2013). Nearly 30 cathelicidins have 
been characterized from domestic and wild mammals 
(Ko$ciuczuk et al, 2012), and are stored as inactive 
precursors in neutrophil granules and activated following 
microbial exposure (Treffers et al, 2005). Apart from 
cathelicidins, several other classes of antimicrobial peptides 
have been isolated and identified in vertebrates (Brogden et 
al., 2003; Zhang, 2006; Zhang et al., 2008). These AMPs have 
diverse MOA and play multifunctional roles, including 
modulation of immune responses, prevention of excessive 
tissue damage, alleviation of inflammation, and destruction of 
invading pathogens to mitigate infection onset (Haney & 
Hancock, 2013; Maróti et al., 2011; Oppenheim et al., 2003). 

Several vertebrate AMPs show excellent therapeutic 
potential due to their antibacterial, antiviral, antifungal, and 
anti-inflammatory properties (Oppenheim et al., 2003; Qi et 
al., 2019), whereas others show improved clinical efficacy, 
structural stability, and potent antimicrobial activity against 
several MDR pathogens following modification (Kumar et al., 
2018; Schmidtchen et al., 2014; Zhang et al., 2019). 

In contrast to vertebrates, insects and plants lack an 
adaptive immune system, relying entirely on the innate 
immune system for defensive purposes, with AMPs playing a 
pivotal role in the protection against invading pathogens. 
Several AMPs have been isolated and analyzed from the 
epithelial cells, hemocytes, and hemolymphs of insects 
(Brown et al., 2009; Hancock et al., 2006; Philippe & Reto, 
2005). Peptides isolated from insects are usually cationic and 
kill bacteria through permeabilizing bacterial membranes, and 
further possess potent microbicidal activity either singly or 
synergistically with traditional antibiotics (Yi et al., 2014). 
Notable examples of insect-based AMPs include cecropins, 
defensins, drosocins, and diptericins, which exhibit activities 
against both gram-negative and gram-positive bacteria as well 
as fungi (Mylonakis et al., 2016; Rozgonyi et al., 2009; Wu et 
al., 2018; Zhang & Gallo, 2016). 

AMPs also play important defensive roles in plants (which 
also lack an innate immune system). Several peptides have 
been isolated and characterized from the leaves, roots, seeds, 
and tubers of plants (Tam et al., 2015). Generally, they exhibit 
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high proteolytic, thermal, and chemical stability due to the 
presence of multiple disulfide bonds conserved within their 
rich cysteine residues (Hammami et al., 2009; Hilchie et al., 
2013; Stotz et al., 2009; Tam et al., 2015). Plant AMPs have 
been extensively studied and discussed (Craik, 2012; 
Hammami et al., 2009; Nawrot et al., 2014; Stec, 2006). Much 
like AMPs from vertebrates, plant-based AMPs confer 
protection against invading pathogens through membrane 
disruption and pore formation and by targeting key microbial 
processes such as DNA and protein synthesis (Nawrot et al., 
2014). Notable examples of plant AMPs include plant 
defensins and thionins (Hilchie et al., 2013; Stotz et al., 2009). 
Plant defensins are small cationic peptides with 45-55 amino 
acid residues and exhibit potent antibacterial and fungal 
activities (Gao et al., 2000; Tavares et al., 2008). Thionins are 
cationic peptides with 45—48 amino acid residues and three or 
four disulfide bonds (Stec, 2006). They exhibit potent activities 
against bacteria and fungi and cytotoxicity on animal cells 
(Evans et al., 1989; Fernandez De Caleya et al., 1972). 

Used singly or in combination with traditional therapeutics, 
AMPs are quite effective at combating infectious agents, 
including MDR bacteria. They exhibit potent microbicidal 
activity in the micromolar range, rapid killing action, and low 
selection of resistance, thus constituting an important strategy 
for curbing MDR pathogens (Deslouches & Di, 2017; Loeffler 
et al., 2001; van't Hof et al., 2001). In contrast to empirical 
antibiotics that target single or specific bacterial processes, 
AMPs exhibit multifunctional bacterial killing effects, including 
disruption of the plasma membrane, and also target microbial 
intracellular processes, including inhibition of transcription and 
translation, protein synthesis, and bacterial cell wall formation 
(Gee et al., 2013; Hurdle et al., 2011; Reddy et al., 2004). 

Presently, much effort is being directed toward obtaining 
novel antimicrobial agents with broad-spectrum activity 
against pathogenic microorganisms. Naturally occurring AMPs 
have proven to be invaluable templates for the design and 
synthesis of synthetic AMPs with increased potency, which are 
easier to produce and less sensitive to proteolytic degradation 
(Falanga et al., 2016; Hurdle et al., 2011). 


CLASSIFICATION AND STRUCTURE OF AMPs 


With more than 3 000 sequences reported to date, AMPs are 
often classified based on their secondary structures and MOA. 
Based on structure, the three currently established classes 
include a-helical, B-sheet, and extended-coiled peptides 
(Figure 2) (Falanga & Galdiero, 2018; Lombardi et al., 2019). 
The a-helical AMPs are unordered in aqueous solution but 
once in contact with a biological membrane, they assume an 
amphipathic a-helical structure (Kumar et al., 2018). Important 
a-helical AMPs include magainin, a 23-amino acid residue 
peptide isolated from the skin of the African clawed frog 
(Xenopus laevis), which exhibits highly potent tumoricidal and 
broad-spectrum antimicrobial activities against bacteria, fungi, 
and protozoa due to membrane disruption (Matsuzaki, 1999; 
Zerweck et al., 2017). The cathelicidin family of AMPs are also 
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important members of this group and are found widely in 
nature in all vertebrates (KoSciuczuk et al, 2012) For 
example, LL-37, a prominent cathelicidin in humans, exhibits 
broad-spectrum microbicidal activities against gram-positive 
and gram-negative bacteria by plasma-membrane disruption 
(Bjórstad et al., 2009). LL-37 has been extensively studied 
and used as a template for the design of several synthetic 
AMPs, many of which are currently under clinical trial (de Breij 
et al., 2018; Durr et al., 2006; KoSciuczuk et al., 2012). Other 
a-helical AMPs include aurein from the granular dorsal glands 
of the frog Litoria aurea (Rai & Qian, 2017), melittin from 
honey-bee venom, and cecropin derived from the hemolymph 
of Hyalophora cecropia (Yi et al., 2014). 

The B-sheet peptides, which constitute the largest group of 
AMPs, are produced in many organisms such as marine 
invertebrates, amphibians, plants, and animals (Haney & 
Hancock, 2013; Maróti et al., 2011; Oppenheim et al., 2003; 
Wang et al., 2016) and show both antibacterial and antifungal 
properties (Harwig et al., 1996). In contrast to «a-helical 
peptides, these peptides are ordered in aqueous solution and 
contain conserved cysteine residues that form disulfide bonds 
that enhance structural stability and minimize proteolytic 
degradation (Tzong-Hsien et al., 2016) They exhibit 
boundless therapeutic prospects, e. g. as antifungal, 
antibacterial, antiviral, and anti-inflammatory agents, and 
primarily kill bacteria through plasma-membrane disruption 
(Kumar et al., 2018; Panteleev et al., 2015). Prominent 
members of this family include defensins, protegrins, and 
tachyplesins. Defensins are found in plants, invertebrates, and 
vertebrates and exhibit activity against both gram-positive and 
gram-negative bacteria, fungi, and viruses (Panteleev et al., 
2015; Yamaguchi & Ouchi, 2012). Tachyplesins, a class of 
peptides isolated from hemocytes of the horseshoe crab, 
exhibit strong microbicidal activity, although their use is 
hampered by their potential toxicity to mammalian cells 
(Edwards et al., 2017; Liu et al., 2018; Miyata et al., 1989). 

The third class of AMPs include those with an extended-coil 
structure. These AMPs lack secondary structures present in 
other AMPs such as a helices and B sheets and mostly 
consist of specific amino acid residues including arginine 
(Arg), proline, or tryptophan (Mahlapuu et al., 2016). They 
exhibit broad-spectrum activity against gram-negative bacteria 
via membrane disruption and targeting internal processes and 
also possess antitumor activity (Falla et al., 1996). Important 
extended-coil AMPs include indolicidin, a 13-amino acid 
peptide produced by bovine leukocytes that shows potent 
antimicrobial activity (Falla et al., 1996), as well as tritrpticin 
(Yang et al., 2002) and human salivary histatin (Du et al., 
2017). 


MECHANISM OF ACTION (MOA) OF AMPs 


For AMPs, there is strong evidence suggesting a close 
correlation between their MOA and physical features or 
primary structure, which are characterized by a net positive 
charge and hydrophobicity (Figure 2) (Dathe & Wieprecht, 
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D Peptide Secondary Length (n) Net Hydrophobicity 
structure charge 
Human LL-37 a-helical 37 6 0.201 
SAAP-148 a-helical 24 11 0.301 
Indolicidin Extended loop 13 3 1.069 
S-thanatin B-sheet 21 6 0.222 
LL-37 SAAP-148 Indolicidin S-thanatin 





Figure 2 Structural diversity and helical wheel projections of representative AMPs 

A: a-helical-magainin (PDB ID 2LSA). B: B-sheet-chicken ovo-defensin (PDB ID 2MJK). C: Extended coil-tritrpticin. Images were created with 
Protein Data Bank (PDP) (Bioinformaticsdoi:10.1093/bioinformatics/bty419) (Rose et al. 2018) and visualized with Jmol software. D: Helical 
wheel projections of four representative peptides showing physical properties canonical to all AMPs, including distribution of amino acid residues, 
net charge, and hydrophobicity established to correlate with antimicrobial activity, selectivity, and cytotoxicity. Positively charged residues (polar) are 
represented as blue circles and hydrophobic (nonpolar) residues are yellow circles. Wheels projections, net charge, and hydrophobicity of AMPs 


were generated with HeliQuest webserver (http://heliquest.ipmc. cnrs.fr/). 


1999; Matsuzaki, 1999). The cationic charge influenced by 
amino acid residues such as Arg and lysine (Lys) enhances 
selectivity for negatively charged bacterial plasma membranes 
while exhibiting low electrostatic interactions with the 
uncharged outer layer of the eukaryotic cell membrane 
(Guilhelmelli et al., 2013). Current efforts at improving the 
selectivity for and electrostatic interaction of AMPs with 
bacterial membranes involve the substitution of amino acids 
with positively charged amino acid residues like Arg and Lys 
(Arias et al., 2018; Yeaman & Yount, 2003). For example, Jin 
et al. (2016) reported that substitution of amino acids 
improved antimicrobial efficacy, increased stability, and 
reduced the hemolytic activity of the designed peptide ZY13 
compared to its precursor peptide cathelicidin-BF 15. 

Previous studies have also indicated a strong correlation 
between the cationic charge of an AMP and its antimicrobial 
activity, with data showing improved antimicrobial activity of 
several AMPs against both bacterial and fungal pathogens 
paralleled with an increase in cationic charge (Gagnon et al., 
2017; Hong et al., 2001; Lyu et al., 2016); however, increased 
hemolytic activity has been observed for several AMPs 
following an increase in net charge (Chen et al., 2005; Jiang 
et al, 2008) Secondly, hydrophobicity is an important 
parameter of all AMPs and is attributed to a substantial 
proportion (almost 5096) of hydrophobic residues in peptide 
sequences, such as leucine, valine, isoleucine, alanine, 
methionine, tyrosine, tryptophan, and phenylalanine. 


Hydrophobicity is crucial for peptide selectivity to biological 
membranes and increased hydrophobicity is correlated with 
increased hemolytic activity as highly hydrophobic AMPs 
have a stronger ability to penetrate the hydrophobic core of 
erythrocyte membranes (Chen et al., 2007; Tachi et al., 
2002). 

Understanding the MOA of AMPs is paramount for 
unlocking their full potential as next-generation antibiotics. 
Extensive research has been conducted, and is still ongoing, 
to unveil the MOA of AMPs (Guilhelmelli et al., 2013; Hancock 
& Lehrer, 1998; Ulm et al., 2012). Based on their MOA, AMPs 
can be classified into those that kill through membrane 
disruptive mechanisms and non-membrane __ disruptive 
mechanisms, as illustrated in Figure 3. For the membrane 
disruptive killing action, AMPs produce microbicidal activity by 
targeting and disrupting the bacterial plasma-membrane 
structure, mostly through permeabilization, thus resulting in 
leakage of intracellular content (Huang et al., 2010). An 
important aspect to consider for this MOA is the concentration 
threshold of peptide molecules on the bacterial membrane 
surface for effective permeabilization via interaction of 
positively charged AMP residues with negatively charged 
moieties on the bacterial membrane surface (Chen et al., 
2007; Shai, 2002). To drive this membrane disruptive MOA, 
three models have been postulated, i.e., barrel-stave, carpet- 
like, and toroidal pore models. For all models, the MOA begins 
with the accumulation and organization of AMP molecules 
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parallel to the membrane surface, followed by electrostatic 
interaction between the AMPs' cationic charged residues and 
negatively charged phospholipids on the membrane surface 
(Falanga et al., 2016). 

Typical for a-helical AMPs, the  barrel-stave model 
constitutes the formation of hydrophilic pores on the 
hydrophobic core of the bacteria membrane structure, 
resulting in membrane disruption and leakage of extracellular 
content (Kumar et al., 2018). For the carpet-like mechanism, 
peptide molecules accumulate parallel to the membrane 
surface in a carpet-like fashion, followed by penetration into 
the membrane and disruption of the lipid bilayer (Brogden, 
2005; Falanga et al., 2016; Rodríguez-Vázquez et al., 2014). 
In contrast, toroidal pore models, e. g., magainin peptide, 
cause membrane disruption by perpendicular insertion into 
the lipid bilayer (Shai, 2002; Wimley, 2010). 

Direct killing through non-membrane disruptive mechanisms 
involves targeting microbial processes or physiological 
functions other than the cell membrane, utilizing similar killing 
mechanisms as those by conventional antibiotics such as 
inhibition of cell wall, protein, and DNA synthesis and reduced 
enzymatic activity (Hancock & Rozek, 2002; Hancock & Sahl, 
2006). Firstly, the AMP interacts with the plasma membrane 
before penetrating with or without membrane permeabilization 
(e. g., activity of buforin Il on E. coli) and accumulating 
intracellularly, where it targets and acts on key processes 
such as transcription and translation, protein synthesis, 
enzymatic activity, and microbial death (Brogden, 2005; Park 
et al., 1998). For example, human a-defensin 1, human a- 
defensin 5, human B-defensin 4, and indolicidin all have 
intracellular targets (Falla et al., 1996; Lehrer et al., 1989; 
Sharma & Nagaraj, 2015). 

In addition to direct microbial killing, AMPs are required for 
other immunomodulatory functions. They are secreted by a 


wide range of immune cells, including phagocytes, 
neutrophils, and macrophages, and are involved in the 
mitigation of infection such as controlled secretion of 
proinflammatory cytokines to prevent cytokine storm and 
tissue damage, recruitment and activation of immune cells, 
promotion of angiogenesis, and suppression of excessive 
reactive oxygen species release (Hancock et al., 2012; Hilchie 
et al., 2013; Nicole et al., 2012; Nijnik & Hancock, 2009). For 
example, human cathelicidin LL-37 exhibits 
immunomodulation properties such as chemoattraction and 
activation of various immune cells such as monocytes, 
neutrophils, and mast cells by using formyl peptide receptor- 
like 1 (De et al., 2000; Nijnik & Hancock, 2009). A thorough 
understanding of the antimicrobial and immunomodulatory 
properties of AMPs provides excellent opportunities for the 
discovery and design of novel therapies for bacterial infection 
as well as inflammatory and cardiovascular diseases such as 
atherosclerosis and thrombosis (Bei et al., 2019; Zhang et al., 
2015). 


ACTIVITY OF AMPs AGAINST MDR NOSOCOMIAL 
BACTERIAL PATHOGENS 


The rapid spread of antibiotic resistance presents a daunting 
clinical challenge due to the recalcitrance of pathogens to 
traditional ^ antibiotics, especially among nosocomial 
pathogens. In both developing and developed countries, 
hospital-acquired infections, commonly referred to as 
nosocomial infections, are a growing concern often associated 
with prolonged hospital stay, increased mortality rates, and 
huge economic burden. Therefore, the discovery of novel 
alternative therapies is paramount. Several AMPs have been 
found to show potent microbicidal activity in the micromolar 
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Figure 3 Schematic of membrane disruptive and non-membrane disruptive bacterial killing mechanisms of AMPs 
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range against several bacteria strains, with a low likelihood for 
selection of resistance, therefore creating hope in the war 
against MDR pathogens (Deslouches & Di, 2017; Loeffler et 
al., 2001; van't Hof et al., 2001). 

Natural and synthetic peptides have proven to be effective 
in combating several pathogens, including MDR gram-positive 
and gram-negative bacteria (see Table 1), with continued 
effort aimed at designing peptides with improved therapeutic 


potential and reduced side effects (Chou et al., 2008; Liu et 
al., 2015). Such strategies have focused on improving 
efficacy, structural stability, and protection from proteolytic 
degradation, and include chemical modifications such as 
acetylation, enrichment with non-natural D-amino acids, 
substitution of amino acid residues, and use of delivery 
systems (Nordstróm & Malmsten, 2017; Zhang et al., 2019; 
Zhao et al., 2016). 





Table 1 Select antimicrobial peptides with potent activity against MDR pathogens 
Peptide Sequence Development phase Description 
Human LL-37 LLGDFFRKSKEKIGKE- From human leucocytes, kills bacteria through pore formation and pos- 
FKRIVQRIKDFLRNLVPRTES sesses immunomodulation activities. 
SAAP-148 (de LKRVWKRVFKLLKRY- Pradliniëal LL-37 derivative, shows potent bactericidal activity through membrane 
Breij et al., 2018) WRQLKKPVR permeabilization and wound healing activity. 
Catelid-E KEFRKLKKSVKKRAKEFFK S s 
(Wang et al., 2008) KPRVIGVSIPF ia mike 3 p 
Low hemolytic activity. 
D-OH-CATH30 KFFKKLKNSVKKRAKKFFK- Preclinical Cathelicidin from snake venom, rapidly kills MDR gram-positive and 
(Zhao et al., 2018) KPRVIGVSIPF gram-negative pathogens, with low hemolytic activity and in vivo toxicity. 
Indolicidin ILPWKWPWWPWRR. I Isolated from bovine leucocytes, shows potent bactericidal activity 
(Falla et al., 1996) through pore formation. 
Omiganan (Melo & Indolicidin derivative, broad-spectrum antimicrobial activity, therapeutic 
Castanho, 2007) ERIS: " agent against acne and catheter related infections. 
Ci-MAM-A24 (Fed-  WRSLGRTLLRLSHALK- Preclinical Isolated from Ciona intestinalis, shows potent bactericidal activity 
ders et al., 2010) PLARRSGW-NH, against MRSA, VRE, and MDR P aeruginosa through pore formation. 
Pexiganan GIGKFLKKAKKFGKAF- i Magainin analog, phase III clinical trials for treatment of bacterial infec- 
(Ge et al., 1999) VKILKK-NH, tions and diabetic foot ulcers. Potent antimicrobial activity. 
S-thanatin GSKKPVPIIYCNRRSGKC- u Thanatin derivative, shows improved antimicrobial activity with reduced 
Preclinical hemolytic activity. Potently inhibits gram-negative growth in vitro and in 
(Wu et al., 2011) QRM : : SORA 
vivo and alleviates sepsis in mice. 
A139 (van der GFCWYVCARRNGARVCYR- = Analog of arenicin-à with B-hairpin structure, exhibits potent microbici- 
I Preclinical dal activity against MDR gram-negative pathogens, and excellent can- 
Weide et al., 2019) RCN : "a p. 
didate for in vivo application. 
Synthetic tetra-branched peptide with potent microbicidal activity 
SET-M33 (Van De ro against MDR bacteria and in vivo therapeutic potential. Currently under 
Weide et al., 2019) KKIRVRESANKBA-OH Preclinical development for treatment of sepsis and lung infections (Brunetti et al., 
2016a). 
EC-hepcidin3 APAKCTPYCYPTHDGVFC- Preclinical Cysteine-rich peptide cloned from marine fish. Potent microbicidal ac- 
P GVRCDFQ tivity against S. aureus and Pseudomonas spp. 


Tachyplesin-1 


(Ohta et al., 1992) KWCFRVCYRG ICYRRCR ll 


Cathelicidin AMPs, which are found in humans and other 
animals, are excellent candidates for therapeutic agents. They 
display multifunctional roles, including potent broad-spectrum 
antimicrobial activity against infectious agents such as fungi, 
viruses, and bacteria, and can trigger specific immune 
responses such as activation and recruitment of immune cells 
(Gennaro et al., 1989; KoSciuczuk et al., 2012). The human 
cathelicidin LL-37, expressed in various tissues and 
circulating cells, is the most extensively studied peptide and 
exhibits potent bactericidal activity against bacterial strains 
and fungi (Bjórstad et al., 2009). Furthermore, it has been 


Cationic B-hairpin peptide from horseshoe crab. Potent microbicidal ac- 
tivity against gram-negative and gram-positive bacteria. Use limited by 
high cytotoxicity. 


used as a template for the design of several synthetic 
peptides with improved antibiotic activity against nosocomial 
pathogens, biofilms, and persister cells (Dürr et al., 2006; 
Xhindoli et al., 2016). A notable LL-37 derivative (SAAP-148) 
designed by de Breij et al. (2018) through amino acid 
substitution of the C-terminal chain of LL-37 shows potent 
microbicidal activity (minimum inhibitory concentration (MIC) 
0.4 to 12.8 um) against several ESKAPE pathogens (e.g., E. 
faecium, S. aureus, K. pneumoniae, A. baumannii, P. 
aeruginosa, and Enterobacter species) without selection of 
resistance. This derivative can also eliminate biofilms and 
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persister cells of S. aureus, A. baumannii and P. aeruginosa in 
the micromolar range. 

Cathelicidins make up the bulk of naturally occurring AMPs 
in snake venom. Several snake venom-based AMPs have 
been established, which potently kill both fungal and bacterial 
pathogens (Jin et al., 2016; Wang et al., 2011; Zhang, 2015). 
Zhao et al. (2018) identified a novel 34-amino acid residue 
cathelicidin (OH-CATH) from the king cobra, with its analog D- 
OH-CATH30 found to exhibit potent microbicidal activity (MIC 
1.56 to 12.5 g/mL) against several gram-negative and gram- 
positive bacteria, including MDR A. baumannii, MDR P 
aeruginosa, MRSA, and E. coli, compared to nine commonly 
used antibiotics. Remarkably, bacteria-killing kinetics revealed 
that D-OH-CATH30 rapidly killed E. coli within 6 min and 
exhibited low hemolytic activity on red blood cells at high 
concentration. Furthermore, in vivo toxicity testing using mice 
revealed a high lethal dose, with no deaths observed at high 
concentrations of 160 mg/kg (Li et al., 2012; Zhao et al., 
2018), thus highlighting their potential as excellent therapeutic 
candidates. 

Cathelicidin-BF is a 30-amino acid residue cathelicidin 
isolated from the venom of Bungarus fasciatus and possesses 
potent antimicrobial activity, even against MDR clinical 
isolates (Wang et al. 2008). Its designed and shortened 15- 
amino acid residue peptide cathelicidin-BF15 also shows 
strong efficacy against fungal species such as Candida 
albicans (Jin et al., 2016; Wang et al., 2008) and against 
several bacterial strains including MDR clinical isolates of E. 
coli, P. aeruginosa, and S. aureus through membrane 
permeabilization (Wang et al., 2008), with low hemolytic 
activity on red blood cells and therapeutic potential against 
acne vulgaris (Wang et al., 2011). 

Several AMPs with potent antimicrobial properties have also 
been isolated from amphibians such as salamanders, frogs, 
and toads (Patocka et al., 2018; Liu et al., 2011; Xiao et al., 
2011). Such peptides are viable targets for the development of 
therapeutic agents. For example, Qi et al., (2019) identified 
two novel cathelicidin peptides, OL-CATH-1 and -2, from the 
frog Odorrana livida, which both show potent antimicrobial 
and anti-inflammatory activities. Wang et al. (2013) isolated 
five novel AMPs from Limnonectes kuhlii frog skin secretions, 
which exhibited potent antimicrobial activity against several 
gram-negative and gram-positive bacterial strains, but with 
low hemolytic activity on mammalian cells. 

Other extensively studied cathelicidins include human- B 
defensin 3 (hBD-3), sheep myeloid peptide (AMP-29), rat 
cathelin-related antimicrobial peptide (rCRAMP), and bovine 
myeloid antimicrobial peptide 27  (BMAP-27), which 
demonstrate potent microbicidal activity against pathogenic 
strains such as E. coli, P. aeruginosa, MRSA, and A. 
baumannii, inhibition of biofilm formation, and 
immunomodulation activity (Dhople et al., 2006; Giacometti et 
al., 2004; Guo et al., 2018; Kosciuczuk et al., 2012; Kurosaka 
et al., 2005). 

Besides cathelicidins, other AMPs with diverse structural 
scaffolds and properties have been reported. Indolicidin, a 
natural cationic peptide from bovine neutrophils, also exhibits 
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potent bactericidal activity through membrane 
permeabilization against gram-negative and gram-positive 
nosocomial pathogens such as E. coli, P. aeruginosa, and S. 
aureus (Falla et al., 1996). Several synthesized derivates of 
indolicidin have shown improved antimicrobial activities 
against a wide panel of MDR nosocomial pathogens with low 
MIC values, e.g., RN7-IN10 and RN7-IN9 (Jindal et al., 2015). 
Omiganan, a novel peptide currently under phase Ill clinical 
trials as a therapeutic agent for bacterial-caused acne and 
catheter-associated bloodstream infection (Melo & Castanho, 
2007), also exhibits potent broad-spectrum antimicrobial 
activity (Sader et al., 2004). 

Also currently under preclinical trial, Ci-MAM-A24, a 
synthetic peptide derivative of a peptide precursor isolated 
from Ciona intestinalis, exhibits potent bactericidal activity 
through membrane permeabilization at low concentration (MIC 
«10 ug/mL) against MDR nosocomial pathogens, including 
MRSA, VRE, MDR ^P aeruginosa, ESBL-producing E. coli 
(Fedders et al., 2010). Thus, it is an important candidate as a 
therapeutic agent against MDR nosocomial pathogens. The 
pexiganan peptide, a synthetic 22-amino acid residue analog 
of magainin isolated from the African clawed frog Xenopus 
laevis (Ge et al., 1999; Zasloff, 1987), also exhibits potent 
broad-spectrum bactericidal ^ activity against several 
pathogens at low concentration. It is currently in phase Ill 
clinical trials as an agent for diabetic foot ulcers caused by 
bacterial infections and can rapidly kill pathogens such as P. 
aeruginosa and several other gram-positive and gram- 
negative bacteria, with a low tendency to induce resistance 
selection (Ge et al., 1999; Mahlapuu et al., 2016). However, 
several reports demonstrate selection of resistance to 
pexiganan following long-term laboratory exposure of 
pathogens (Habets & Brockhurst, 2012; Perron et al., 2006). 

Thanatin is another potential therapeutic candidate for MDR 
pathogens. It is a 21-amino acid residue peptide and its 
analog, S-thanatin, exhibits low hemolytic activity and potent 
antimicrobial activity against several strains of gram-negative 
and gram-positive bacteria (Wu et al., 2011). Furthermore, 
with improved therapeutic activity, S-thanatin shows potent 
activity against nosocomial pathogens K. pneumoniae and E. 
coli at very low concentrations, and can alleviate sepsis in 
mice models, thus making it an excellent therapeutic 
candidate (Ding et al., 2009; Wu et al., 2013). 

Colistin is an important AMP used as a last-resort drug for 
the treatment of MDR pathogen infections. Recent reports of 
selection of resistance to colistin thus present a concerning 
global healthcare predicament in the fight against MDR 
bacterial infections (Falagas et al., 2005; MacNair et al., 2018; 
Yamamoto et al., 2018). Remarkably, two novel AMPs (AA139 
and SET-M33) with similar MOA as colistin are currently in 
development and have shown excellent therapeutic potential 
both in vitro against several MDR gram-pathogens and in-vivo 
in animal disease models (van der Weide et al., 2019). AA139 
is a 21-amino acid amphipathic peptide with reduced cytotoxic 
and hemolytic activity. It was designed from arenicin-3, a 
peptide with potent microbicidal activity against several MDR 
bacterial strains and isolated from the marine lugworm 


Arenicola marina (Sierra et al., 2017; Wang et al., 2018). SET- 
M33 is a synthetic AMP exhibiting potent microbicidal activity 
against gram-negative bacteria through membrane disruption 
and an excellent candidate for in vivo application, as 
evidenced from animal models of infectious diseases having 
both anti-inflammatory and immunomodulatory activities 
(Brunetti et al., 2016b; Falciani et al., 2012; van der Weide et 
al., 2017). 

Several AMPs with potent activity against gram-positive and 
gram-negative MDR and showing excellent prospects as 
therapeutic agents have also been isolated and characterized 
from marine animals (Destoumieux-Garzón et al., 2016; Tincu 
& Taylor., 2004). For example, mytimacin-AF, an 80-cysteine- 
rich amino acid residue isolated from marine mollusks, shows 
potent activity against both gram-positive and gram-negative 
bacteria, especially against nosocomial pathogens S. aureus 
and K. pneumoniae (MIC<2 g/mL) (Zhong et al., 2013). EC- 
hepcidin3, cloned from marine fish Epinephelus coioides, is a 
novel cysteine-rich AMP with rapid and potent microbicidal 
activity against S. aureus and Pseudomonasstutzeri (MIC« 
3 pmol/L) (Qu et al., 2013). Tachyplesins, a class of peptides 
isolated from horseshoe crab hemocytes, are cationic B- 
hairpin structured peptides with potent antimicrobial activities 
against MDR gram-negative and gram-positive bacteria at low 
concentrations (Falanga et al. 2016; Liu et al, 2018; 
Nakamura et al., 1988), although with increasing hemolytic 
and cytotoxic activity at higher levels (Edwards et al., 2017; 
Liu et al., 2018; Ohta et al., 1992). 

Continued research on AMPs has yielded a new class of 
highly potent peptides called "Selectively Targeted AMPs" 
(STAMPs), which demonstrate increased sensitivity to target 
pathogens without harming normal flora and increased 
bacterial killing potency and kinetics (Chung & Khanum, 
2017). Such AMPs have been designed to target pathogens 
through specific determinants followed by selective killing 
(Sarma et al., 2018). Notable STAMPs include synthetic 
peptide M8(KH)-20, a multi-headed peptide specifically 
targeting and potently killing P aeruginosa and Streptococcus 
mutans in vitro with very little effect on other pathogens (He et 
al., 2009). Oritavancin, another synthetic STAMP designed 
from the naturally occurring glycopeptide chloroeremoycin and 
currently in clinical development, is reported to be highly 
selective and potently kills MDR pathogens such as methicillin- 
resistant S. aureus and vancomycin-resistant S. aureus 
(VRSA) more rapidly than vancomycin through membrane 
permeabilization and inhibition of cell wall synthesis (Allen & 
Nicas, 2003; Chung & Khanum, 2017). 


CHALLENGES FACING EXPLOITATION OF AMPs AS 
THERAPEUTIC AGENTS AND SOLUTION STRATEGIES 


To be considered for approval as a therapeutic agent, effective 
AMPs need to show broad-spectrum activity, high selectivity 
for bacterial pathogens, and low cytotoxicity on mammalian 
cells (Dathe & Wieprecht, 1999). Currently, various AMPs are 
under clinical trial; however, several factors curtail their full 


utilization and approval as antibiotic alternatives (Falanga et 
al., 2016; Lombardi et al., 2019; Wang et al., 2016). The main 
factor limiting the systemic application of AMPs is their 
sensitivity to proteolytic digestion. For instance, host 
proteolytic enzymes in intestinal mucosa, gastrointestinal 
tract, and blood plasma can readily degrade antimicrobial 
peptides, which negatively impacts their in vivo stability and 
pharmacokinetics (Moncla et al., 2011; Starr & Wimley, 2017). 
Therefore, as a result, many AMPs are limited to topical 
application rather than oral or intravenous administration. 
Moreover, studies have demonstrated that, as a defensive 
mechanism in the presence of AMPs, certain bacteria will up- 
regulate the secretion of proteolytic enzymes and 
metalloproteases (MMPs) that can partially or completely 
degrade AMPs (Lai et al., 2007; Sieprawska-Lupa et al., 
2004). 

Toxicity and efficacy of AMPs is another major drawback of 
their use. Several AMPs have adverse side effects in vivo, 
which limits their use to topical application only (McPhee & 
Hancock, 2005). Moreover, in addition to the low correlation 
between in vitro- and in vivo-based results in some cases, 
physiological conditions such as high salt concentration and 
serum components can negatively affect the antimicrobial 
activity of many AMPs (Cantisani et al., 2014; de Breij et al., 
2018; Han et al., 2016). For most AMPs, antimicrobial activity 
is dependent on the electrostatic interactions between 
positively charged peptides and negatively charged plasma 
membranes (Guilhelmelli et al., 2013). Such interactions can 
be affected by elevated salt levels or binding of AMPs to 
serum components (e. g., lipoproteins) (Li et al., 2017). 
Therefore, efficacy screening of AMPs under different 
physiological conditions and in the presence of host 
components is necessary to affirm their activity. 

Compared to conventional antibiotics, AMPs are also 
expensive to produce. Therefore, current efforts are being 
directed at designing shorter peptides with reduced side 
effects and improved stability under different physiological 
conditions (Kim et al., 2013; Zhao et al., 2015). For example, 
Li et al. (2017) pioneered an important strategy for designing 
AMPs with trypsin inhibitor activity based on a peptide isolated 
from the frog Odorrana grahami. 

Although AMPs have a low likelihood to select for 
resistance, reports already exist detailing the development of 
resistance against some AMPs (Andersson et al., 2016; 
Omardien et al., 2016); for example, the development of 
resistance to colistin by A. baumannii following long-term 
clinical application (Jeannot et al., 2017; Liu et al., 2016). 
Colistin is a last-resort drug used for the treatment of MDR 
nosocomial pathogens, and thus resistance to colistin is an 
important clinical issue (Liu et al., 2016; MacNair et al., 2018; 
Paterson & Harris., 2016). Several mechanisms have been 
reported to be responsible for resistance to AMPs, including 
expression of efflux pumps, surface charge modification to 
impede membrane-peptide electrostatic interactions, and 
increased secretion of proteolytic enzymes (Andersson et al., 
2016; Bechinger & Gorr, 2017; Morita et al., 2012). For 
example, S. aureus can alter surface charge by adding basic 
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amino groups from D-alanine from the cytoplasm to teichoic 
acid on the membrane surface (Peschel et al., 1999). 
Resistance to AMPs also raises other critical issues such as 
the potential development of cross-resistance to antibiotic 
regimens and host AMPs (Conlon, 2015; Kubicek-Sutherland 
et al., 2017), which could create unprecedented detrimental 
consequences. 

Presently, studies are focusing on developing strategies to 
improve the efficacy of AMPs in vivo, enhance selectivity for 
microbial cells while reducing cytotoxicity, increase stability, 
and minimize proteolytic degradation. Chemical modifications 
such as the addition of D-amino acids, cyclization, or 
acetylation are important strategies used to improve the 
stability of AMPs and reduce sensitivity to proteolytic 
degradation (Gao et al., 2018; Zhao et al., 2016). However, 
additional modifications add to production costs. To solve this, 
efforts are currently being directed at the design and synthesis 
of shorter peptides with increased potency and stability and 
reduced toxicity (Jin et al., 2016; Kim et al., 2013). 

The use of delivery systems is another important strategy 
for improving the stability and efficacy of AMPs (Nordstróm & 
Malmsten, 2017). Of particular interest is the use of 
nanocarriers, which are designed and covalently attached to 
AMPs to prevent self-aggregation, improve chemical stability, 
and release profiles of AMPs to target sites. Importantly, such 
nanocarriers are synthesized from materials that are easily 
biodegradable, including lipids (such as triglycerides and 
cholesterol) and polymers such as gels, cellulose, and fiber 
(D'angelo et al., 2015; Mahlapuu et al., 2016; Yüksel et al., 
2016). 


FUTURE PROSPECTS AND CONCLUSIONS 


In this review, we reiterated the importance of AMPs as 
potential next-generation antibiotics to mitigate a wide array of 
microbial infections, including those caused by MDR strains. 
Natural and synthetic AMPs are unique and exhibit boundless 
therapeutic potential. Remarkably, AMPs display diverse 
MOA, with a low tendency to select for resistance, rapid killing 
ability, and multifunctional activities, thus conferring great 
advantages over empirical antibiotics. Extensive research on 
the origin, structure, and biological properties has improved 
our understanding and use of AMPs. Of particular importance 
is the potential of these peptides to act as templates and 
precursor molecules for the development of novel anticancer 
and antimicrobial agents for bacterial, fungal, and viral 
infections where most available therapies have been rendered 
ineffective due to the rapid emergence and spread of 
resistance. Furthermore, these studies allow for the 
generation of models mimicking the interaction of these 
peptides with biological membranes, a key element in their 
microbicidal activity. Such insight is important in the design 
and synthesis of AMPs with increased potency against MDR 
strains and improved chemical stability. 

Presently, substantial effort is being dedicated to the design 
and synthesis of shorter peptides with improved therapeutic 
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activity and reduced cytotoxicity. Such efforts are bearing fruit, 
as evidenced in the presence of short synthetic AMPs with 
potent and rapid microbicidal activity against several MDR 
pathogens, even those showing resistance to last-resort 
drugs, thus presenting hope in the war against multidrug 
resistance. 


COMPETING INTERESTS 


The authors declare that they have no competing interests. 


AUTHORS' CONTRIBUTIONS 


J. M., X: H., R. L., and Y. Z. Z. conceived the review and prepared the 
manuscript. All authors contributed to the discussions. All authors read and 
approved the final version of the manuscript. 


REFERENCES 


Adedeji WA. 2016. The treasure called antibiotics. Annals of Ibadan 
Postgraduate Medicine, 14(2): 56-57. 

Alanis AJ. 2005. Resistance to antibiotics: Are we in the post-antibiotic era?. 
Archives of Medical Research, 36(6): 697—705. 

Allen NE, Nicas TI. 2003. Mechanism of action of oritavancin and related 
glycopeptide antibiotics. FEMS Microbiology Reviews, 26(5): 511—532. 
Anderson DJ. 2011. Surgical site Infections. Infectious Disease Clinics of 
North America, 25(1): 135-153. 

Andersson DI, Hughes D, Kubicek-Sutherland JZ. 2016. Mechanisms and 
consequences of bacterial resistance to antimicrobial peptides. Drug 
Resistance Updates, 26: 43-57. 

Arias CA, Murray BE. 2012. The rise of the enterococcus: beyond 
vancomycin resistance. Nature Reviews Microbiology, 10(4): 266-278. 

Arias M, Piga KB, Hyndman ME, Vogel HJ. 2018. Improving the activity of 
trp-rich antimicrobial peptides by arg/lys substitutions and changing the 
length of cationic residues. Biomolecules, 8(2): 19. 

Aslam B, Wang W, Arshad MI, Khurshid M, Muzammil S, Rasool MH, Nisar 
MA, Alvi RF, Aslam MA, Qamar MU, Salamat MKF, Baloch Z. 2018. 
Antibiotic resistance: a rundown of a global crisis. Infection and Drug 
Resistance, 11: 1645-1658. 

Ayukekbong JA, Ntemgwa M, Atabe AN. 2017. The threat of antimicrobial 
resistance in developing countries: causes and control strategies. 
Antimicrobial Resistance & Infection Control, 6(1): 47. 

Bechinger B, Gorr SU. 2017. Antimicrobial peptides: mechanisms of action 
and resistance. Journal of Dental Research, 96(3): 254—260. 

Bei YH, Pan LL, Zhou QL, Zhao CM, Xie Y, Wu CF, Meng XM, Gu HY, Xu 
JH, Zhou L, Sluijter JPG, Das S, Agerberth B, Sun J, Xiao JJ. 2019. 
Cathelicidin-related antimicrobial peptide protects against myocardial 
ischemia/reperfusion injury. BMC Medicine, 17(1): 42. 

Bhat G, Kamath S, Hussain A. 2006. Nosocomial methicillin--resistant 
Staphylococcus aureus with reduced susceptibility to vancomycin. Indian 
Journal of Pathology & Microbiology, 49(2): 311-312. 

Bjorstad Á , Askarieh G, Brown KL, Christenson K, Forsman H, Onnheim K, 
Li H-N, Teneberg S, Maier O, Hoekstra D, Dahlgren C, Davidson DJ, 
Bylund J. 2009. The host defense peptide LL-37 selectively permeabilizes 
apoptotic leukocytes. Antimicrobial Agents and Chemotherapy, 53(3): 1027— 


1038. 

Blair JMA, Webber MA, Baylay AJ, Ogbolu DO, Piddock LJV. 2014. 
Molecular 
Microbiology, 13: 42-51. 

Brogden KA. 2005. Antimicrobial peptides: pore formers or metabolic 


mechanisms of antibiotic resistance. Nature Reviews 


inhibitors in bacteria?. Nature Reviews Microbiology, 3(3): 238—250. 
Brogden KA, Ackermann M, Mccray Jr PB, Tack BF. 2003. Antimicrobial 
peptides in animals and their role in host defences. /nternational Journal of 
Antimicrobial Agents, 22(5): 465-478. 

Brown SE, Howard A, Kasprzak AB, Gordon KH, East PD. 2009. A 
peptidomics study reveals the impressive antimicrobial peptide arsenal of 
the wax moth Galleria mellonella. Insect Biochemistry and Molecular 
Biology, 39(11): 792-800. 

Brunetti J, Falciani C, Roscia G, Pollini S, Bindi S, Scali S, Arrieta UC, 
Gómez-Vallejo V, Quercini L, Ibba E, Prato M, Rossolini GM, Llop J, Bracci 
L, Pini A. 2016a. In vitro and in vivo efficacy, toxicity, bio-distribution and 
resistance selection of a novel antibacterial drug candidate. Scientific 
Reports, 6: 26077-26077. 

Brunetti J, Roscia G, Lampronti I, Gambari R, Quercini L, Falciani C, Bracci 
L, Pini A. 2016b. Immunomodulatory and anti-inflammatory activity in vitro 
and in vivo of a novel antimicrobial candidate. The Journal of Biological 
Chemistry, 291(49): 25742-25748. 

Cantisani M, Finamore E, Mignogna E, Falanga A, Nicoletti GF, Pedone C, 
Morelli G, Leone M, Galdiero M, Galdiero S. 2014. Structural insights into 
and activity analysis of the antimicrobial peptide myxinidin. Antimicrobial 
Agents and Chemotherapy, 58(9): 5280-5290. 

Centers for Disease Control and Prevention. 1997. Update: staphylococcus 
aureus with reduced susceptibility to vancomycin--united states, 1997. 
mmwr. Morbidity and Mortality Weekly Report, 46(35): 813-815. 

Chang S, Sievert DM, Hageman JC, Boulton ML, Tenover FC, Downes FP, 
Shah S, Rudrik JT, Pupp GR, Brown WJ, Cardo D, Fridkin SK. 2003. 
Infection with vancomycin-resistant Staphylococcus aureus containing the 
vana resistance gene. New England Journal of Medicine, 348(14): 1342— 
1347. 

Chen YX, Guarnieri MT, Vasil Al, Vasil ML, Mant CT, Hodges RS. 2007. 
Role of peptide hydrophobicity in the mechanism of action of alpha-helical 
antimicrobial peptides. Antimicrobial Agents and Chemotherapy, 51(4): 
1398-1406. 

Chen YX, Mant CT, Farmer SW, Hancock REW, Vasil ML, Hodges RS. 
2005. Rational design of alpha-helical antimicrobial peptides with enhanced 
activities and specificity/therapeutic index. The Journal of Biological 
Chemistry, 280(13): 12316-12329. 

Chou HT, Kuo TY, Chiang JC, Pei MJ, Yang WT, Yu HC, Lin SB, Chen WJ. 
2008. Design and synthesis of cationic antimicrobial peptides with improved 
activity and selectivity against Vibrio spp. International Journal of 
Antimicrobial Agents, 32(2): 130—138. 

Chung PY, Khanum R. 2017. Antimicrobial peptides as potential anti-biofilm 
agents against multidrug-resistant bacteria. Journal of Microbiology, 
Immunology and Infection, 50(4): 405—410. 

Conlon JM. 2015. Host-defense peptides of the skin with therapeutic 
potential: from hagfish to human. Peptides, 67: 29-38. 

Craik DJ. 2012. Host-defense activities of cyclotides. Toxins (Basel), 4(2): 
139-156. 

D' angelo |, Casciaro B, Miro A, Quaglia F, Mangoni ML, Ungaro F. 2015. 


Overcoming barriers in Pseudomonas aeruginosa lung infections: 
engineered nanoparticles for local delivery of a cationic antimicrobial 
peptide. Colloids and Surfaces B: Biointerfaces, 135: 717—725. 

Dathe M, Wieprecht T. 1999. Structural features of helical antimicrobial 
peptides: their potential to modulate activity on model membranes and 
biological cells. Biochimica et Biophysica Acta (BBA) - Biomembranes, 
1462(1-2): 71-87. 

Davies J, Davies D. 2010. Origins and evolution of antibiotic resistance. 
Microbiology and Molecular Biology Reviews, 74(3): 417—433. 

de Breij A, Riool M, Cordfunke RA, Malanovic N, De Boer L, Koning RI, 
Ravensbergen E, Franken M, van der Heijde T, Boekema BK, Kwakman 
PHS, Kamp N, El Ghalbzouri A, Lohner K, Zaat SaJ , Drijfhout JW, 
Nibbering PH. 2018. The antimicrobial peptide SAAP-148 combats drug- 
resistant bacteria and biofilms. Science Translational Medicine, 10(423): 
eaan4044. 

de Kraker MEA, Stewardson AJ, Harbarth S. 2016. Will 10 million people 
die a year due to antimicrobial resistance by 2050?. PLoS Medicine, 13(11): 
e1002184. 

De Y, Chen Q, Schmidt AP, Anderson GM, Wang JM, Wooters J, 
Oppenheim JJ, Chertov O. 2000. LL-37, the neutrophil granule-and 
epithelial cell-derived cathelicidin, utilizes formyl peptide receptor-like 1 
(FPRL1) as a receptor to chemoattract human peripheral blood neutrophils, 
monocytes, and T cells. The Journal of Experimental Medicine, 192(7): 
1069-1074. 

Denis O. 2017. Route of transmission of Staphylococcus aureus. The 
Lancet Infectious Diseases, 17(2): 124-125. 

Deslouches B, Di YP. 2017. Antimicrobial peptides: a potential therapeutic 
option for surgical site infections. Clinics in Surgery, 2: 1740. 
Destoumieux-Garzón D, Rosa RD, Schmitt P, Barreto C, Vidal-Dupiol J, 
Mitta G, Gueguen Y, Bachére E. 2016. Antimicrobial peptides in marine 
invertebrate health and disease. Philosophical Transactions of the Royal 
Society B: Biological Sciences, 371(1695): 20150300. 

Dever LA, Dermody TS. 1991. Mechanisms of bacterial resistance to 
antibiotics. JAMA Internal Medicine, 151(5): 886-895. 

Dhople V, Krukemeyer A, Ramamoorthy A. 2006. The human beta-defensin- 
3, an antibacterial peptide with multiple biological functions. Biochimica et 
Biophysica Acta (BBA) - Biomembranes, 1758(9): 1499-1512. 

Ding JX, Wu GQ, Zhao R, Li LX, Shen ZL, Xi T. 2009. Activity of S-thanatin 
thanatin analog against multidrug-resistant gram-negative organisms in 
vitro and the synergy with 7 kinds of antibiotics. Pharmaceutical 
biotechnology, 16(2): 158-161. 

Dotel R, O'sullivan M, Gilbert G. 2017. Staphylococcus aureus in critical 
care. The Lancet Infectious Diseases, 17(6): 579—580. 

Du H, Puri S, Mccall A, Norris HL, Russo T, Edgerton M. 2017. Human 
salivary protein histatin 5 has potent bactericidal activity against eskape 
pathogens. Frontiers in Cellular and Infection Microbiology, T: 41. 

Dunne WM Jr. 2002. Bacterial adhesion: seen any good biofilms lately?. 
Clinical Microbiology Reviews, 15(2): 155-166. 

Dürr UHN, Sudheendra US, Ramamoorthy A. 2006. LL-37, the only human 
member of the cathelicidin family of antimicrobial peptides. Biochimica et 
Biophysica Acta (BBA) - Biomembranes, 1758(9): 1408-1425. 

Edwards IA, Elliott AG, Kavanagh AM, Blaskovich MAT, Cooper MA. 2017. 
Structure-activity and-toxicity relationships of the antimicrobial peptide 
tachyplesin-1. ACS Infectious Diseases, 3(12): 917—926. 


Zoological Research 40(6): 488-505, 2019 499 


Evans J, Wang YD, Shaw KP, Vernon LP. 1989. Cellular responses to 
Pyrularia thionin are mediated by Ca?* influx and phospholipase A2 
activation and are inhibited by thionin tyrosine iodination. Proceedings of 
the National Academy of Sciences of the United States of America, 86(15): 
5849-5853. 

Falagas ME, Kasiakou SK, Saravolatz LD. 2005. Colistin: the revival of 
polymyxins for the management of multidrug-resistant gram-negative 
bacterial infections. Clinical Infectious Diseases, 40(9): 1333-1341. 

Falanga A, Galdiero S. 2018. Emerging therapeutic agents on the basis of 
naturally occurring antimicrobial peptides. The Royal Society of Chemistry, 
42: 190-227. 

Falanga A, Lombardi L, Franci G, Vitiello M, lovene MR, Morelli G, Galdiero 
M, Galdiero S. 2016. Marine antimicrobial peptides: nature provides 
templates for the design of novel compounds against pathogenic bacteria. 
International Journal of Molecular Sciences, 17(5): 785. 

Falciani C, Lozzi L, Pollini S, Luca V, Carnicelli V, Brunetti J, Lelli B, Bindi S, 
Scali S, Di Giulio A, Rossolini GM, Mangoni ML, Bracci L, Pini A. 2012. 
Isomerization of an antimicrobial peptide broadens antimicrobial spectrum 
to gram-positive bacterial pathogens. PLoS One, 7(10): e46259. 

Fala TJ, Karunaratne DN, Hancock REW. 1996. Mode of action of 
the antimicrobial peptide indolicidin. The Journal of Biological Chemistry, 
271(32): 19298-19303. 

Fedders H, Podschun R, Leippe M. 2010. The antimicrobial peptide Ci- 
MAM-A24 is highly active against multidrug-resistant and anaerobic 
bacteria pathogenic for humans. /nternational Journal of Antimicrobial 
Agents, 36(3): 264-266. 

Fernandez De Caleya R, Gonzalez-Pascual B, Garcia-Olmedo F, 
Carbonero P. 1972. Susceptibility of phytopathogenic bacteria to wheat 
purothionins in vitro. Applied Microbiology, 23(5): 998—1000. 

Franchi D, Climo MW, Wong AH, Edmond MB, Wenzel RP. 1999. Seeking 
vancomycin resistant Staphylococcus aureus among patients with 
vancomycin-resistant enterococci. Clinical Infectious Diseases, 29(6): 1566- 
1568. 

Frecer V, Ho B, Ding JL. 2004. De novo design of potent antimicrobial 
peptides. Antimicrobial Agents and Chemotherapy, 48(9): 3349-3357. 
Gagnon MC, Strandberg E, Grau-Campistany A, Wadhwani P, Reichert J, 
Bürck J, Rabanal F, Auger M, Paquin JF, Ulrich AS. 2017. Influence of the 
length and charge on the activity of a-helical amphipathic antimicrobial 
peptides. Biochemistry, 56(11): 1680—1695. 

Ganz T. 2003. The role of antimicrobial peptides in innate immunity1. 
Integrative and Comparative Biology, 43(2): 300—304. 

Gao AG, Hakimi SM, Mittanck CA, Wu Y, Woerner BM, Stark DM, Shah 
DM, Liang J, Rommens CMT. 2000. Fungal pathogen protection in potato 
by expression of a plant defensin peptide. Nature Biotechnology, 18(12): 
1307-1310. 

Gao YD, Fang HT, Fang L, Liu DW, Liu JS, Su MH, Fang Z, Ren WZ, Jiao 
HP. 2018. The modification and design of antimicrobial peptide. Current 
Pharmaceutical Design, 24(8): 904—910. 

Ge Y, Macdonald DL, Holroyd KJ, Thornsberry C, Wexler H, Zasloff M. 
1999. In vitro antibacterial properties of pexiganan, an analog of magainin. 
Antimicrobial Agents and Chemotherapy, 43(4): 782-788. 

Gee ML, Burton M, Grevis-James A, Hossain MA, Mcarthur S, Palombo EA, 
Wade JD, Clayton AHA. 2013. Imaging the action of antimicrobial peptides 
on living bacterial cells. Scientific Reports, 3: 1557-1557. 


500  www.zoores.ac.cn 


Gennaro R, Skerlavaj B, Romeo D. 1989. Purification, composition, and 
activity of two bactenecins, antibacterial peptides of bovine neutrophils. 
Infection and Immunity, 57(10): 3142-3146. 

Giacometti A, Cirioni O, Ghiselli R, Mocchegiani F, D'amato G, Circo R, 
Orlando F, Skerlavaj B, Silvestri C, Saba V, Zanetti M, Scalise G. 2004. 
Cathelicidin peptide sheep myeloid antimicrobial peptide-29 prevents 
endotoxin-induced mortality in rat models of septic shock. American Journal 
of Respiratory and Critical Care Medicine, 169(2): 187—194. 

Gold HS, Moellering RC. 1996. Antimicrobial-drug resistance. New England 
Journal of Medicine, 335(19): 1445-1453. 

Groisman EA, Ochman H. 1996. Pathogenicity islands: bacterial evolution 
in quantum leaps. Cell, 87(5): 791—794. 

Gudiol C, Carratala J. 2014. Antibiotic resistance in cancer patients. Expert 
Review of Anti-infective Therapy, 12(8): 1003-1016. 

Guilhelmelli F, Vilela N, Albuquerque P, Derengowski LDS, Silva-Pereira |, 
Kyaw CM. 2013. Antibiotic development challenges: the various 
mechanisms of action of antimicrobial peptides and of bacterial resistance. 
Frontiers in Microbiology, 4: 353-353. 

Guo YJ, Xun M, Han J. 2018. A bovine myeloid antimicrobial peptide 
(BMAP-28) and its analogs kill pan-drug-resistant Acinetobacter baumannii 
by interacting with outer membrane protein A (OmpA). Medicine (Baltimore), 
97(42): e12832-e12832. 

Gurung J, Khyriem AB, Banik A, Lyngdoh WV, Choudhury B, Bhattacharyya 
P. 2013. Association of biofilm production with multidrug resistance among 
clinical isolates of Acinetobacter baumannii and Pseudomonas aeruginosa 
from intensive care unit. Indian Journal of Critical Care Medicine, 17(4): 
214-218. 

Habets MGJL , Brockhurst MA. 2012. Therapeutic antimicrobial peptides 
may compromise natural immunity. Biology Letters, 8(3): 416-418. 
Hammami R, Ben Hamida J, Vergoten G, Fliss |. 2009. Phytamp: a 
database dedicated to antimicrobial plant peptides. Nucleic Acids 
Research, 37 (Database issue): D963-D968. 

Han HM, Gopal R, Park Y. 2016. Design and membrane-disruption 
mechanism of charge-enriched AMPs exhibiting cell selectivity, high-salt 
resistance, and anti-biofilm properties. Amino Acids, 48(2): 505-522. 
Hancock REW, Brown KL, Mookherjee N. 2006. Host defence peptides 
from invertebrates — emerging antimicrobial strategies. /mmunobiology, 
211(4): 315-322. 

Hancock REW, Lehrer R. 1998. Cationic peptides: a new source of 
antibiotics. Trends in Biotechnology, 16(2): 82-88. 

Hancock REW, Nijnik A, Philpott DJ. 2012. Modulating immunity as a 
therapy for bacterial infections. Nature Reviews Microbiology, 10: 243—254. 
Hancock REW, Rozek A. 2002. Role of membranes in the activities 
of antimicrobial cationic peptides. FEMS Microbiology Letters, 206(2): 
143-149. 

Hancock REW, Sahl HG. 2006. Antimicrobial and host-defense peptides as 
new anti-infective therapeutic strategies. Nature Biotechnology, 24: 1551— 
1557. 

Haney EF, Hancock REW. 2013. Peptide design for antimicrobial and 
immunomodulatory applications. Peptide Science, 100(6): 572-583. 

Harder J, Schróder JM, Glaser R. 2013. The skin surface as antimicrobial 
barrier: present concepts and future outlooks. Experimental Dermatology, 
22(1): 1-5. 

Harwig SSL, Waring A, Yang HJ, Cho Y, Tan L, Lehrer RI. 1996. 


Intramolecular disulfide bonds enhance the antimicrobial and lytic activities 
of protegrins at physiological sodium chloride concentrations. European 
Journal of Biochemistry, 240(2): 352-357. 

He J, Anderson MH, Shi WY, Eckert R. 2009. Design and activity of a 'dual- 
targeted’ antimicrobial peptide. International Journal of Antimicrobial 
Agents, 33(6): 532-537. 

Hilchie AL, Wuerth K, Hancock REW. 2013. Immune modulation by 
multifaceted cationic host defense (antimicrobial) peptides. Nature 
Chemical Biology, 9: 761—768. 

Hirsch EB, Tam VH. 2010. Impact of multidrug-resistant Pseudomonas 
aeruginosa infection on patient outcomes. Expert Review 
Pharmacoeconomics & Outcomes Research, 10(4): 441—451. 

Hong SY, Park TG, Lee KH. 2001. The effect of charge increase on the 
specificity and activity of a short antimicrobial peptide. Peptides, 22(10): 
1669-1674. 

Huang YB, Huang JF, Chen YX. 2010. Alpha-helical cationic antimicrobial 
peptides: relationships of structure and function. Protein & Cell, 1(2): 
143-152. 

Hurdle JG, O'neill AJ, Chopra I, Lee RE. 2011. Targeting bacterial 
membrane function: an underexploited mechanism for treating persistent 
infections. Nature Reviews Microbiolology, 9(1): 62-75. 

Jeannot K, Bolard A, Plésiat P. 2017. Resistance to polymyxins in gram- 
negative organisms. International Journal of Antimicrobial Agents, 49(5): 
526-535. 

Jesus B, Antonio O, Jose-Maria GG. 2002. Mutation and evolution of 
antibiotic resistance: antibiotics as promoters of antibiotic resistance?. 
Current Drug Targets, 3(4): 345-349. 

Jiang ZQ, Vasil Al, Hale JD, Hancock REW, Vasil ML, Hodges RS. 2008. 
Effects of net charge and the number of positively charged residues on the 
biological activity of amphipathic alpha-helical cationic antimicrobial 
peptides. Biopolymers, 90(3): 369-383. 

Jin L, Bai XW, Luan N, Yao HM, Zhang ZY, Liu WH, Chen Y, Yan XW, Rong 
MQ, Lai R, Lu QM. 2016. A designed tryptophan-and lysine/arginine-rich 
antimicrobial peptide with therapeutic potential for clinical antibiotic-resistant 
Candida albicans vaginitis. Journal of Medicinal Chemistry, 59(5): 1791— 
1799. 

Jindal HM, Le CF, Mohd Yusof MY, Velayuthan RD, Lee VS, Zain SM, Isa 
DM, Sekaran SD. 2015. Antimicrobial activity of novel synthetic peptides 
derived from indolicidin and ranalexin against Streptococcus pneumoniae. 
PLoS One, 10(6): e0128532. 

Jing Y, Yue XC, Yang S, Li S. 2019. Association of aspirin resistance with 
increased mortality in ischemic stroke. The Journal of Nutrition, Health & 
Aging, 23(3): 266-270. 

Kang HK, Kim C, Seo CH, Park Y. 2017. The therapeutic applications of 
antimicrobial peptides (AMPs): a patent review. Journal of Microbiology, 
55(1): 1-12. 

Keren I, Mulcahy LR, Lewis K. 2012. Persister eradication: lessons from the 
world of natural products. Methods in Enzymology, 517: 387—406. 

Khan HA, Baig FK, Mehboob R. 2017. Nosocomial infections: epidemiology, 
prevention, control and surveillance. Asian Pacific Journal of Tropical 
Biomedicine, 7(5): 478-482. 

Kim H, Jang JH, Kim SC, Cho JH. 2013. De novo generation of short 
antimicrobial peptides with enhanced stability and cell specificity. Journal of 
Antimicrobial Chemotherapy, 69(1): 121—132. 


Koenig SM, Truwit JD. 2006. Ventilator-associated pneumonia: diagnosis, 
treatment, and prevention. Clinical Microbiogy Reviews, 19(4): 637—657. 
Koo HB, Seo J. 2019. Antimicrobial peptides under clinical investigation. 
Peptide Science, doi: 10.1002/pep2.24122. 

Kosciuczuk EM, Lisowski P, Jarczak J, Strzałkowska N, Jóźwik A, 
Horbanczuk J, Krzyzewski J, Zwierzchowski L, Bagnicka E. 2012. 
Cathelicidins: family of antimicrobial peptides. A review. Molecular Biolology 
Reports, 39(12): 10957-10970. 

Kubicek-Sutherland JZ, Lofton H, Vestergaard M, Hjort K, Ingmer H, 
Andersson DI. 2017. Antimicrobial peptide exposure selects for 
Staphylococcus aureus resistance to human defence peptides. The Journal 
of Antimicrobial Chemotherapy, 72(1): 115-127. 

Kumar P, Kizhakkedathu JN, Straus SK. 2018. Antimicrobial peptides: 
diversity, mechanism of action and strategies to improve the activity and 
biocompatibility in vivo. Biomolecules, 8(1): 4. 

Kümmerer K. 2004. Resistance in the environment. Journal of Antimicrobial 
Chemotherapy, 54(2): 311—320. 

Kurosaka K, Chen Q, Yarovinsky F, Oppenheim JJ, Yang D. 2005. Mouse 
cathelin-related antimicrobial peptide chemoattracts leukocytes using formyl 
peptide receptor-like 1/mouse formyl peptide receptor-like 2 as the 
receptor and acts as an immune adjuvant. The Journal of Immunology, 
174(10): 6257-6265. 

Lai YP, Villaruz AE, Li M, Cha DJ, Sturdevant DE, Otto M. 2007. The human 
anionic antimicrobial peptide dermcidin induces proteolytic defence 
mechanisms in staphylococci. Molecular Microbiology, 63(2): 497—506. 
Laxminarayan R, Brown GM. 2001. Economics of antibiotic resistance: a 
theory of optimal use. Journal of Environmental Economics and 
Management, 42(2): 183-206. 

Lehrer RI, Barton A, Daher KA, Harwig SS, Ganz T, Selsted ME. 1989. 
Interaction of human defensins with Escherichia coli. mechanism of 
bactericidal activity. The Journal of Clinical Investigation, 84(2): 553-561. 
Levin AS, Barone AA, Pengo J, Santos MV, Marinho IS, Arruda EAG, 
Manrique El, Costa SF. 1999. Intravenous colistin as therapy for 
nosocomial infections caused by  multidrug-resistant Pseudomonas 
aeruginosa and Acinetobacter baumannii. Clinical Infectious Diseases, 
28(5): 1008-1011. 

Lewis K. 2008. Multidrug tolerance of biofilms and persister cells. /n: 
Romeo T. Bacterial Biofilms. Berlin, Heidelberg: Springer Berlin Heidelberg, 
107-131. 

Li JG, Koh JJ, Liu SP, Lakshminarayanan R, Verma CS, Beuerman RW. 
2017. Membrane active antimicrobial peptides: translating mechanistic 
insights to design. Frontiers in Neuroscience, 11: 73-73. 

Li SA, Lee WH, Zhang Y. 2012. Efficacy of OH-CATH30 and its analogs 
against drug-resistant bacteria in vitro and in mouse models. Antimicrobial 
Agents and Chemotherapy, 56(6): 3309-3317. 

Lin J, Nishino K, Roberts MC, Tolmasky M, Aminov RI, Zhang LX. 2015. 
Mechanisms of antibiotic resistance. Frontiers in Microbiology, 6: 34. doi: 
10.3389/fmicb.2015.00034. 

Linden PK. 2002. Treatment options for vancomycin-resistant enterococcal 
infections. Drugs, 62(3): 425-441. 

Liu B, Pop M. 2009. ARDB--antibiotic resistance genes database. Nucleic 
Acids Research, 37 (Database issue): DA43-D447. 

Liu CB, Qi JL, Shan B, Ma YB. 2018. Tachyplesin causes membrane 
instability that kills multidrug-resistant bacteria by inhibiting the 3-ketoacyl 


Zoological Research 40(6): 488-505, 2019 501 


carrier protein reductase fabG. Frontiers in Microbiology, 9: 825. doi: 
10.3389/fmicb.2018.00825. 

Liu CB, Shan B, Bai HM, Tang J, Yan LZ, Ma YB. 2015. Hydrophilic/ 
hydrophobic characters of antimicrobial peptides derived from animals and 
their effects on multidrug resistant clinical isolates. Zoological Research, 
36(1): 41-47. 

Liu R, Liu H, Ma YF, Wu J, Yang HL, Ye HH, Lai R. 2011. There are 
abundant antimicrobial peptides in brains of two kinds of Bombina toads. 
Journal of Proteome Research, 10(4): 1806-1815. 

Liu YY, Wang Y, Walsh TR, Yi LX, Zhang R, Spencer J, Doi Y, Tian G, Dong 
B, Huang XH, Yu LF, Gu DX, Ren HW, Chen XJ, Lv LC, He DD, Zhou HW, 
Liang Z, Liu J-H, Shen JZ. 2016. Emergence of plasmid-mediated colistin 
resistance mechanism MCR-1 in animals and human beings in China: a 
microbiological and molecular biological study. The Lancet Infectious 
Diseases, 16(2): 161—168. 

Loeffler JM, Nelson D, Fischetti VA. 2001. Rapid killing of Streptococcus 
pneumoniae with a bacteriophage cell wall hydrolase. Science, 294(5549): 
2170-2172. 

Lombardi L, Falanga A, Del Genio V, Galdiero S. 2019. A new hope: self- 
assembling peptides with antimicrobial activity. Pharmaceutics, 11(4): 166. 
Lu Y, Ma YF, Wang X, Liang JG, Zhang CX, Zhang KY, Lin GQ, Lai R. 2008. 
The first antimicrobial peptide from sea amphibian. Molecular Immunology, 
45(3): 678-681. 

Lupei MI, Mann HJ, Beilman GJ, Oancea C, Chipman JG. 2010. 
Inadequate antibiotic therapy in solid organ transplant recipients is 
associated with a higher mortality rate. Surgical Infections, 11(1): 33-39. 

Lyu YF, Yang Y, Lyu X, Dong N, Shan A. 2016. Antimicrobial activity, 
improved cell selectivity and mode of action of short PMAP-36-derived 
peptides against bacteria and Candida. Scientific Reports, 6: 27258-27258. 
Macnair CR, Stokes JM, Carfrae LA, Fiebig-Comyn AA, Coombes BK, 
Mulvey MR, Brown ED. 2018. Overcoming mcr-1 mediated colistin 
resistance with colistin in combination with other antibiotics. Nature 
Communications, 9(1): 458. 

Mahlapuu M, Hákansson J, Ringstad L, Bjórn C. 2016a. Antimicrobial 
peptides: an emerging category of therapeutic agents. Frontiers in Cellular 
and Infection Microbiology, 6: 194—194. 

Manchanda V, Sanchaita S, Singh NP. 2010. Multidrug resistant 
acinetobacter. Journal of Global Infectious Diseases, 2(3): 291—304. 

Maróti G, Kereszt A, Kondorosi É , Mergaert P. 2011. Natural roles of 
antimicrobial peptides in microbes, plants and animals. Research in 
Microbiology, 162(4): 363-374. 

Martínez JL, Baquero F. 2002. Interactions among strategies associated 
with bacterial infection: pathogenicity, epidemicity, and antibiotic resistance. 
Clinical Microbiology Reviews, 15(4): 647—679. 

Martinez RM, Wolk DM. 2016. Bloodstream infections. /n: Hayden R, Wolk 
D, Carroll K, Tang YW. Diagnostic Microbiology of the Immunocompromised 
Host, 2nd edn. Washington DC: ASM Press. 

Matsuzaki K. 1999. Why and how are peptide-lipid interactions utilized for 
self-defense? Magainins and tachyplesins as archetypes. Biochimica et 
Biophysica Acta (BBA) - Biomembranes, 1462(1-2): 1—10. 

Mcphee JB, Hancock REW. 2005. Function and therapeutic potential of 
host defence peptides. Journal of Peptide Science, 11(11): 677-687. 

Melo MN, Castanho MARB. 2007. Omiganan interaction with bacterial 


membranes and cell wall models. Assigning a biological role to saturation. 


502  www.zoores.ac.cn 


Biochimica et Biophysica Acta (BBA) - Biomembranes, 1768(5): 1277—1290. 
Miller LG, Perdreau-Remington F, Rieg G, Mehdi S, Perlroth J, Bayer AS, 
Tang AW, Phung TO, Spellberg B. 2005. Necrotizing fasciitis caused by 
community-associated methicillin-resistant Staphylococcus aureus in Los 
Angeles. New England Journal of Medicine, 352(14): 1445-1453. 

Mills S, Serrano LM, Griffin C, O'connor PM, Schaad G, Bruining C, Hill C, 
Ross RP, Meijer WC. 2011. Inhibitory activity of Lactobacillus plantarum 
LMG P-26358 against Listeria innocua when used as an adjunct starter in 
the manufacture of cheese. Microbial Cell Factories, 10 (Suppl 1): S7—S7. 
Miyata T, Tokunaga F, Yoneya T, Yoshikawa K, Iwanaga S, Niwa M, Takao 
T, Shimonishi Y. 1989. Antimicrobial peptides, isolated from horseshoe crab 
hemocytes, tachyplesin Il, and polyphemusins | and Il: chemical structures 
and biological activity. The Journal of Biochemistry, 106(4): 663-668. 

Moncla BJ, Pryke K, Rohan LC, Graebing PW. 2011. Degradation of 
naturally occurring and engineered antimicrobial peptides by proteases. 
Advances in Bioscience and Biotechnology, 2(6): 404—408. 

Morita Y, Tomida J, Kawamura Y. 2012. Mexxy multidrug efflux system of 
Pseudomonas aeruginosa. Frontiers in Microbiology, 3: 408. 

Munita JM, Arias CA. 2016. Mechanisms of antibiotic resistance. 
Microbiology Spectrum, 4(2), doi: 10.1128/microbiolspec. VMBF-0016-2015. 
Mylonakis E, Podsiadlowski L, Muhammed M, Vilcinskas A. 2016. Diversity, 
evolution and medical applications of insect antimicrobial peptides. 
Philosophical Transactions of the Royal Society of London. Series B, 
Biological Sciences, 371(1695): 20150290. 

Nagarajan D, Roy N, Kulkarni O, Nanajkar N, Datey A, Ravichandran S, 
Thakur C, T S, Aprameya IV, Sarma SP, Chakravortty D, Chandra N. 2019. 
076: A designed antimicrobial peptide to combat carbapenem-and 
tigecycline-resistant Acinetobacter baumannii. Science Advances, 5(7): 
eaax1946-eaax1946. 

Nakamura T, Furunaka H, Miyata T, Tokunaga F, Muta T, lwanaga S, Niwa 
M, Takao T, Shimonishi Y. 1988. Tachyplesin, a class of antimicrobial 
peptide from the hemocytes of the horseshoe crab (Tachypleus tridentatus). 
Isolation and chemical structure. Journal of Biological Chemistry, 263(32): 
16709-16713. 

Nawrot R, Barylski J, Nowicki G, Broniarczyk J, Buchwald W, Gozdzicka- 
Józefiak A. 2014. Plant antimicrobial peptides. Folia Microbiologica, 59(3): 
181-196. 

Nicole JA, Amy TYY, Olga MP, Robert EWH. 2012. Therapeutic potential of 
host defense peptides in 
Pharmaceutical Design, 18(6): 807—819. 

Nicolle LE. 2014. Catheter associated urinary tract infections. Antimicrobial 


antibiotic-resistant infections. Current 


Resistance and Infection Control, 3: 23-23. 

Nijnik A, Hancock REW. 2009. Host defence peptides: antimicrobial and 
immunomodulatory activity and potential applications for tackling antibiotic- 
resistant infections. Emerging Health Threats Journal, 2: e1. 

Noble WC, Virani Z, Cree RGA. 1992. Co-transfer of vancomycin and other 
resistance genes from Enterococcus faecalis NCTC 12201 to 
Staphylococcus aureus. FEMS Microbiology Letters, 93(2): 195-198. 
Nordstróm R, Malmsten M. 2017. Delivery systems for antimicrobial 
peptides. Advances in Colloid and Interface Science, 242: 17-34. 

Ochoa TJ, Ruiz J, Molina M, Del Valle LJ, Vargas M, Gil Al, Ecker L, 
Barletta F, Hall E, Cleary TG, Lanata CF. 2009. High frequency of 
antimicrobial drug resistance of diarrheagenic Escherichia coli in infants in 


peru. The American Journal of Tropical Medicine and Hygiene, 81(2): 


296-301. 

Ohta M, Ito H, Masuda K, Tanaka S, Arakawa Y, Wacharotayankun R, Kato 
N. 1992. Mechanisms of antibacterial action of tachyplesins and 
polyphemusins, a group of antimicrobial peptides isolated from horseshoe 
crab hemocytes. Antimicrobial Agents and Chemotherapy, 36(7): 1460 — 
1465. 

Omardien S, Brul S, Zaat SAJ. 2016. Antimicrobial activity of cationic 
antimicrobial peptides against gram-positives: current progress made in 
understanding the mode of action and the response of bacteria. Frontiers in 
Cell and Developmental Biology, 4: 111—111. 

Oppenheim JJ, Biragyn A, Kwak LW, Yang D. 2003. Roles of antimicrobial 
peptides such as defensins in innate and adaptive immunity. Annals of the 
Rheumatic Diseases, 62(suppl 2): ii17—21. 

Othman HB, Halim RMA, Gomaa FAM, Amer MZ. 2019. Vancomycin MIC 
distribution among methicillin-resistant Staphylococcus aureus. Is reduced 
vancomycin susceptibility related to MIC creep?. Open Access Macedonian 
Journal of Medical Sciences, 7(1): 12-18. 

Panteleev PV, Bolosov IA, Balandin SV, Ovchinnikova TV. 2015. Structure 
and biological functions of B -hairpin antimicrobial peptides. Acta Naturae, 
7(1): 37-47. 

Park CB, Kim HS, Kim SC. 1998. Mechanism of action of the antimicrobial 
peptide buforin ll: buforin ii kills microorganisms by penetrating the cell 
membrane and inhibiting cellular functions. Biochemical and Biophysical 
Research Communications, 244(1): 253-257. 

Pasupuleti M, Schmidtchen A, Malmsten M. 2012. Antimicrobial peptides: 
key components of the innate immune system. Critical Reviews in 
Biotechnology, 32(2): 143-171. 

Paterson DL, Harris PNA. 2016. Colistin resistance: a major breach in our 
last line of defence. The Lancet Infectious Diseases, 16(2): 132—133. 
Patocka J, Nepovimova E, Klimova B, Wu QH, Kuca K. 2018. Antimicrobial 
peptides: amphibian host defense peptides. Current Medicinal Chemistry, 
25: 1-21. 

Perron GG, Zasloff M, Bell G. 2006. Experimental evolution of resistance to 
an antimicrobial peptide. Proceedings of the Royal Society B: Biological 
sciences, 273(1583): 251-256. 

Peschel A, Otto M, Jack RW, Kalbacher H, Jung G, Gótz F. 1999. 
Inactivation of the dlt operon /nstaphylococcus aureus confers sensitivity to 
defensins, protegrins, and other antimicrobial peptides. Journal of Biological 
Chemistry, 274(13): 8405-8410. 

Peters L, Olson L, Khu DTK, Linnros S, Le NK, Hanberger H, Hoang NTB, 
Tran DM, Larsson M. 2019. Multiple antibiotic resistance as a risk factor for 
mortality and prolonged hospital stay: a cohort study among neonatal 
intensive care patients with hospital-acquired infections caused by gram- 
negative bacteria in Vietnam. PLoS One, 14(5): e0215666. 

Philippe B, Reto S. 2005. Insect antimicrobial peptides: structures, 
properties and gene regulation. Protein & Peptide Letters, 12(1): 3-11. 
Phillips I, Casewell M, Cox T, De Groot B, Friis C, Jones R, Nightingale C, 
Preston R, Waddell J. 2004. Does the use of antibiotics in food animals 
pose a risk to human health? A critical review of published data. Journal of 
Antimicrobial Chemotherapy, 53(1): 28—52. 

Poirel L, Jayol A, Nordmann P. 2017. Polymyxins: Antibacterial activity, 
susceptibility testing, and resistance mechanisms encoded by plasmids or 
chromosomes. Clinical Microbiology Reviews, 30(2): 557—596. 

Price JR, Cole K, Bexley A, Kostiou V, Eyre DW, Golubchik T, Wilson DJ, 


Crook DW, Walker AS, Peto TEA, Llewelyn MJ, Paul J. 2017. Transmission 
of Staphylococcus aureus between health-care workers, the environment, 
and patients in an intensive care unit a longitudinal cohort study 
based on whole-genome sequencing. The Lancet Infectious Diseases, 
17(2): 207-214. 

Qi RH, Chen Y, Guo ZL, Zhang F, Fang Z, Huang K, Yu HN, Wang YP. 
2019. Identification and characterization of two novel cathelicidins from the 
frog Odorrana livida. Zoological Research, 40(2): 94—101. 

Qu HD, Chen B, Peng H, Wang KJ. 2013. Molecular cloning, recombinant 
expression, and antimicrobial activity of EC-hepcidin3, a new four-cysteine 
hepcidin isoform from Epinephelus coioides. Bioscience, Biotechnology, 
and Biochemistry, 77(1): 103-110. 

Rai DK, Qian S. 2017. Interaction of the antimicrobial peptide aurein1.2 and 
charged lipid bilayer. Scientific Reports, 7(1): 3719. 

Rapoport M, Faccone D, Pasteran F, Ceriana P, Albornoz E, Petroni A, 
Corso A. 2016. First description of mcr-1-mediated colistin resistance in 
human infections caused by Escherichia coli in Latin America. Antimicrobial 
Agents and Chemotherapy, 60(7): 4412. 

Reddy KVR, Yedery RD, Aranha C. 2004. Antimicrobial peptides: premises 
and promises. International Journal of Antimicrobial Agents, 24(6): 
536-547. 

Richard P, Floch RL, Chamoux C, Pannier M, Espaze E, Richt H. 1994. 
Pseudomonas aeruginosa outbreak in a burn unit: role of antimicrobials in 
the emergence of multiply resistant strains. The Journal of Infectious 
Diseases, 170(2): 377—383. 

Rinaldi AC. 2002. Antimicrobial peptides from amphibian skin: an expanding 
scenario: commentary. Current Opinion in Chemical Biology, 6(6): 799-804. 
Rodríguez-Vázquez N, Lionel Ozores H, Guerra A, González-Freire E, 
Fuertes A, Panciera M, Priegue JM, Outeiral J, Montenegro J, Garcia- 
Fandino R, Amorín M, Granja JR. 2014. Membrane-targeted self- 
assembling cyclic peptide nanotubes. Current Topics in Medicinal 
Chemistry, 14(23): 2647—2461. 

Rollins-Smith LA, Reinert LK, O'leary CJ, Houston LE, Woodhams DC. 
2005. Antimicrobial peptide defenses in amphibian skin. /ntegrative and 
Comparative Biology, 45(1): 137—142. 

Rose AS, Bradley AR, Valasatava Y, Duarte JM, Prlió A, Rose PW. 2018. 
NGL viewer: web-based molecular graphics for large complexes. 
Bioinformatics, 34(21): 3755-3758. 

Rozgonyi F, Szabo D, Kocsis B, Ostorhazi E, Abbadessa G, Cassone M, 
Wade JD, L. Otvos J. 2009. The antibacterial effect of a proline-rich 
antibacterial peptide A3-APO. Current Medicinal Chemistry, 16(30): 3996- 
4002. 

Sader HS, Fedler KA, Rennie RP, Stevens S, Jones RN. 2004. Omiganan 
pentahydrochloride (MBI 226), a topical 12-amino-acid cationic peptide: 
spectrum of antimicrobial activity and measurements of bactericidal activity. 
Antimicrobial Agents and Chemotherapy, 48(8): 3112-3118. 

Sarma P, Mahendiratta S, Prakash A, Medhi B. 2018. Specifically targeted 
antimicrobial peptides: a new and promising avenue in selective 
antimicrobial therapy. Indian Journal of Pharmacology, 50(1): 1—3. 
Schmidtchen A, Pasupuleti M, Malmsten M. 2014. Effect of hydrophobic 
modifications in antimicrobial peptides. Advances in Colloid and Interface 
Science, 205: 265-274. 

Shai Y. 2002. Mode of action of membrane active antimicrobial peptides. 
Peptide Science, 66(4): 236-248. 


Zoological Research 40(6): 488-505, 2019 503 


Sharma H, Nagaraj R. 2015. Human g-defensin 4 with non-native disulfide 
bridges exhibit antimicrobial activity. PLoS One, 10(3): e0119525. 
Sieprawska-Lupa M, Mydel P, Krawczyk K, Wójcik K, Puklo M, Lupa B, 
Suder P, Silberring J, Reed M, Pohl J, Shafer W, Mcaleese F, Foster T, 
Travis J, Potempa J. 2004. Degradation of human antimicrobial peptide LL- 
37 by Staphylococcus aureus-derived proteinases. Antimicrobial Agents 
and Chemotherapy, 48(12): 4673-4679. 

Sierra JM, Fusté E, Rabanal F, Vinuesa T, Viñas M. 2017. An overview of 
antimicrobial peptides and the latest advances in their development. Expert 
Opinion on Biological Therapy, 17(6): 663-676. 

Silva CCG, Silva SPM, Ribeiro S. 2018. Application of bacteriocins and 
protective cultures in dairy food preservation. Frontiers in Microbiology, 
9: 594. 

Silveira E, Freitas AR, Peixe L, Novais C. 2009. Environmental spread of 
antibiotic molecules, antibiotic resistant bacteria and genes: jigsaw pieces 
of a public health problem. Revista da Faculdade de Ciencias da Sande, 6: 
244-253. 

Starr CG, Wimley WC. 2017. Antimicrobial peptides are degraded by the 
cytosolic proteases of human erythrocytes. Biochimica et Biophysica Acta - 
Biomembranes, 1859(12): 2319-2326. 

Stec B. 2006. Plant thionins — the structural perspective. Cellular and 
Molecular Life Sciences CMLS, 63(12): 1370-1385. 

Stefania G, Annarita F, Rita B, Paolo G, Giancarlo M, Massimiliano G. 2015. 
Antimicrobial peptides as an opportunity against bacterial diseases. Current 
Medicinal Chemistry, 22(14): 1665-1677. 

Stewart PS. 2002. Mechanisms of antibiotic resistance in bacterial biofilms. 
International Journal of Medical Microbiology, 292(2): 107—113. 

Stotz HU, Thomson JG, Wang Y. 2009. Plant defensins: defense, 
development and application. Plant Signaling & Behavior, 4(11): 1010-1012. 
Struelens MJ. 1998. The epidemiology of antimicrobial resistance in 
hospital acquired infections: problems and possible solutions. British 
Medical Journal, 317(7159): 652-654. 

Tachi T, Epand RF, Epand RM, Matsuzaki K. 2002. Position-dependent 
hydrophobicity of the antimicrobial magainin peptide affects the mode of 
peptide - lipid interactions and selective toxicity. Biochemistry, 41(34): 
10723-10731. 

Tam JP, Wang SJ, Wong KH, Tan WL. 2015. Antimicrobial peptides from 
plants. Pharmaceuticals (Basel), 8(4): 711-757. 

Tavares LS, Santos MDO, Viccini LF, Moreira JS, Miller RNG, Franco OL. 
2008. Biotechnological potential of antimicrobial peptides from flowers. 
Peptides, 29(10): 1842-1851. 

Tincu JA, Taylor SW. 2004. Antimicrobial peptides from marine 
invertebrates. Antimicrobial Agents and Chemotherapy, 48(10): 3645- 
3654. 

Tong ZC, Ni LX, Ling JQ. 2014. Antibacterial peptide nisin: a potential role 
in the inhibition of oral pathogenic bacteria. Peptides, 60: 32—40. 

Treffers C, Chen L, Anderson RC, Yu PL. 2005. Isolation and 
characterisation of antimicrobial peptides from deer neutrophils. 
International Journal of Antimicrobial Agents, 26(2): 165-169. 

Tzong-Hsien L, Kristopher NH, Marie-Isabel A. 2016. Antimicrobial peptide 
structure and mechanism of action: a focus on the role of membrane 
structure. Current Topics in Medicinal Chemistry, 16(1): 25-39. 

Ulm H, Wilmes M, Shai Y, Sahl HG. 2012. Antimicrobial host defensins- 


specific antibiotic activities and innate defense modulation. Frontiers in 


504  www.zoores.ac.cn 


Immunology, 3: 249-249. 

van Boeckel TP, Brower C, Gilbert M, Grenfell BT, Levin SA, Robinson TP, 
Teillant A, Laxminarayan R. 2015. Global trends in antimicrobial use in food 
animals. Proceedings of the National Academy of Sciences of the United 
States of America, 112(18): 5649-5654. 

van der Weide H, Brunetti J, Pini A, Bracci L, Ambrosini C, Lupetti P, 
Paccagnini E, Gentile M, Bernini A, Niccolai N, Jongh DV, Bakker- 
Woudenberg IAJM , Goessens WHF, Hays JP, Falciani CS. 2017. 
Investigations into the killing activity of an antimicrobial peptide active 
against extensively antibiotic-resistant K. pneumoniae and P. aeruginosa. 
Biochimica et Biophysica Acta (BBA) - Biomembranes, 1859(10): 1796 - 
1804. 

van der Weide H, Vermeulen-De Jongh DMC, van der Meijden A, Boers SA, 
Kreft D, Ten Kate MT, Falciani C, Pini A, Strandh M, Bakker-Woudenberg 
IAJM , Hays JP, Goessens WHF. 2019. Antimicrobial activity of two novel 
antimicrobial peptides AA139 and SET-M33 against clinically and 
genotypically diverse Klebsiella pneumoniae isolates with differing antibiotic 
resistance profiles. International Journal of Antimicrobial Agents, 54(2): 
159-166. 

van't Hof W, Veerman ECI, Helmerhorst EJ, Amerongen AVN. 2001. 
Antimicrobial peptides: Properties and applicability. Biological Chemistry, 
382(4): 597—619. 

Wakabayashi H, Hiratani T, Uchida K, Yamaguchi H. 1996. Antifungal 
spectrum and fungicidal mechanism of an N-terminal peptide of bovine 
lactoferrin. Journal of Infection and Chemotherapy, 1(3): 185-189. 

Wang GS, Li X, Wang Z. 2016. APD3: the antimicrobial peptide database 
as a tool for research and education. Nucleic Acids Research, 44(D1): 
D1087-D1093. 

Wang GX, Wang Y, Ma DY, Liu H, Li JX, Zhang KY, Yang XL, Lai R, Liu JZ. 
2013. Five novel antimicrobial peptides from the kuhl' s wart frog skin 
secretions, Limnonectes kuhlii. Molecular Biology Reports, 40(2): 1097 — 
1102. 

Wang YP, Hong J, Liu XH, Yang HL, Liu R, Wu J, Wang A, Lin DH, Lai R. 
2008. Snake cathelicidin from Bungarus fasciatus is a potent peptide 
antibiotics. PLoS One, 3(9): e3217-e3217. 

Wang YP, Zhang ZY, Chen LL, Guang HJ, Li Z, Yang HL, Li JX, You DW, Yu 
HN, Lai R. 2011. Cathelicidin-BF, a snake cathelicidin-derived antimicrobial 
peptide, could be an excellent therapeutic agent for acne vulgaris. PLoS 
One, 6(7): €22120-e22120. 

Wang ZL, Wang XM, Wang JH. 2018. Recent Advances in antibacterial and 
antiendotoxic peptides or proteins from marine resources. Marine Drugs, 
16(2): 57. 

WHO. 2014. Antimicrobial resistance: Global report on surveillance. https:// 
www.who.int/drugresistance/documents/AMR report Web slide set.pdf. 
Wimley WC. 2010. Describing the mechanism of antimicrobial peptide 
action with the interfacial activity model. ACS Chemical Biology, 5(10): 
905-917. 

Woodhams DC, Ardipradja K, Alford RA, Marantelli G, Reinert LK, Rollins- 
Smith LA. 2007. Resistance to chytridiomycosis varies among amphibian 
species and is correlated with skin peptide defenses. Animal Conservation, 
10(4): 409-417. 

Wu GQ, Li XF, Fan XB, Wu HB, Wang SL, Shen ZL, Xi T. 2011. The activity 
of antimicrobial peptide S-thanatin is independent on multidrug-resistant 
spectrum of bacteria. Peptides, 32(6): 1139-1145. 


Wu GQ, Wu PP, Xue XL, Yan XJ, Liu S, Zhang C, Shen ZL, Xi T. 2013. 
Application of S-thanatin, an antimicrobial peptide derived from thanatin, in 
mouse model of Klebsiella pneumoniae infection. Peptides, 45: 73-77. 

Wu QH, Patočka J, Kuča K. 2018. Insect antimicrobial peptides, a mini 
review. Toxins (Basel), 10(11): 461. 

Xhindoli D, Pacor S, Benincasa M, Scocchi M, Gennaro R, Tossi A. 2016. 
The human cathelicidin LL-37 — a pore-forming antibacterial peptide and 
host-cell modulator. Biochimica et Biophysica Acta (BBA) - Biomembranes, 
1858(3): 546-—566. 

Xiao Y, Liu CB, Lai R. 2011. Antimicrobial peptides from amphibians. 
BioMolecular Concepts, 2(1-2): 27-38. 

Yamaguchi Y, Ouchi Y. 2012. Antimicrobial peptide defensin: identification of 
novel isoforms and the characterization of their physiological roles and their 
significance in the pathogenesis of diseases. Proceedings of the Japan 
Academy. Series B, Physical and Biological Sciences, 88(4): 152—166. 
Yamamoto Y, Kawahara R, Fujiya Y, Sasaki T, Hirai |, Khong DT, Nguyen 
TN, Nguyen BX. 2018. Wide dissemination of colistin-resistant Escherichia 
coli with the mobile resistance gene mcr in healthy residents in Vietnam. 
Journal of Antimicrobial Chemotherapy, 74(2): 523-524. 

Yamauchi K, Tomita M, Giehl TJ, Ellison RT, 3rd. 1993. Antibacterial activity 
of lactoferrin and a pepsin-derived lactoferrin peptide fragment. Infection 
and Immunity, 61(2): 719-728. 

Yang SC, Lin CH, Sung CT, Fang JY. 2014. Antibacterial activities of 
bacteriocins: application in foods and pharmaceuticals. Frontiers in 
Microbiology, 5: 241. 

Yang ST, Yub Shin S, Kim YC, Kim Y, Hahm KS, Kim Jl. 2002. 
Conformation-dependent antibiotic activity of tritrpticin, a cathelicidin- 
derived antimicrobial peptide. Biochemical and Biophysical Research 
Communications, 296(5): 1044—1050. 

Yeaman MR, Yount NY. 2003. Mechanisms of antimicrobial peptide action 
and resistance. Pharmacological Reviews, 55(1): 27—55. 

Yi HY, Chowdhury M, Huang YD, Yu XQ. 2014. Insect antimicrobial 
peptides and their applications. Applied Microbiology and Biotechnology, 
98(13): 5807—5822. 

Yüksel E, Karakeçili A, Demirtaş TT, Gümüsderelioslu M. 2016. Preparation 
of bioactive and antimicrobial PLGA membranes by magainin Il//EGF 
functionalization. International Journal of Biological Macromolecules, 86: 
162-168. 

Zaman SB, Hussain MA, Nye R, Mehta V, Mamun KT, Hossain N. 2017. A 
review on antibiotic resistance: alarm bells are ringing. Cureus, 9(6): e1403. 
Zasloff M. 1987. Magainins, a class of antimicrobial peptides from xenopus 


skin: isolation, characterization of two active forms, and partial cDNA 
sequence of a precursor. Proceedings of the National Academy of Sciences 
of the United States of America, 84(15): 5449-5453. 

Zerweck J, Strandberg E, Kukharenko O, Reichert J, Bürck J, Wadhwani P, 
Ulrich AS. 2017. Molecular mechanism of synergy between the 
antimicrobial peptides PGLa and magainin 2. Scientific Reports, 7(1): 
13153. 

Zhang F, Guo ZL, Chen Y, Li L, Yu HN, Wang YP. 2019. Effects of C- 
terminal amidation and heptapeptide ring on the biological activities and 
advanced structure of amurin-9KY, a novel antimicrobial peptide identified 
from the brown frog, Rana kunyuensis. Zoological Research, 40(3): 
198-204. 

Zhang LJ, Gallo RL. 2016. Antimicrobial peptides. Current Biology, 26(1): 
R14-R19. 

Zhang Y. 2006. Amphibian skin secretions and bio-adaptive significance — 
implications from Bombina maxima skin secretion proteome. Zoological 
Research, 27(1): 101-112. 

Zhang Y. 2015. Why do we study animal toxins?. Zoological Research, 
36(4): 183-22. 

Zhang YX, Zou AH, Manchu RG,Zhou YC,Wang SF. 2008. Purification and 
antimicrobial activity of antimicrobial protein from brown-spotted grouper, 
Epinephelus fario. Zoological Research, 29(6): 627—632. 

Zhang ZY, Meng P, Han Y, Shen CB, Li BW, Hakim MA, Zhang XG, Lu QM, 
Rong MQ, Lai R. 2015. Mitochondrial DNA-LL-37 complex promotes 
atherosclerosis by escaping from autophagic recognition. Immunity, 43(6): 
1137-1147. 

Zhao CX, Dwyer MD, YuAL, Wu Y, Fang S, Middelberg APJ. 2015. A simple 
and low-cost platform technology for producing pexiganan antimicrobial 
peptide in E. coli. Biotechnology and Bioengineering, 112(5): 957-964. 

Zhao F, Lan XQ, Du Y, Chen PY, Zhao J, Zhao F, Lee WH, Zhang Y. 2018. 
King cobra peptide OH-CATH30 as a potential candidate drug through clinic 
drug-resistant isolates. Zoological Research, 39(2): 87—96. 

Zhao XW, Wu HY, Lu HR, Li GD, Huang QS. 2013. LAMP: A database 
linking antimicrobial peptides. PLoS One, 8(6): e66557. 

Zhao YY, Zhang M, Qiu S, Wang JY, Peng JX, Zhao P, Zhu RR, Wang HL, 
Li Y, Wang KR, Yan WJ, Wang R. 2016. Antimicrobial activity and stability of 
the D-amino acid substituted derivatives of antimicrobial peptide polybia- 
MPI. AMB Express, 6(1): 122. 

Zhong J, Wang WH, Yang XM, Yan XW, Liu R. 2013. A novel cysteine-rich 
antimicrobial peptide from the mucus of the snail of Achatina fulica. 
Peptides, 39: 1—5. 


Zoological Research 40(6): 488-505, 2019 505 


ZOOLOGICAL RESEARCH 


Chromosomal level assembly and population 
sequencing of the Chinese tree shrew genome 


Yu Fan'’, Mao-Sen Yet’, Jin-Yan Zhang'?, Ling Xu!2, Dan-Dan Yu'?, Tian-Le Gut’, Yu-Lin Yao'?, Jia-Qi Chen*, Long-Bao 
Lv^, Ping Zheng??^**"7, Dong-Dong Wu?*, Guo-Jie Zhang”, Yong-Gang Yao'***5" 


! Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences & Yunnan Province, Kunming 
Institute of Zoology, Chinese Academy of Sciences, Kunming Yunnan 650223, China 

? Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming Yunnan 650223, China 

3 Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming Yunnan 650204, China 

^ Kunming Primate Research Center of the Chinese Academy of Sciences, Kunming Institute of Zoology, Chinese Academy of Sciences, 


Kunming Yunnan 650223, China 


5 KIZ—CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese 


Academy of Sciences, Kunming Yunnan 650223, China 


$ State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 


Yunnan 650223, China 


7 Yunnan Key Laboratory of Animal Reproduction, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming Yunnan 650223, 


China 


ABSTRACT 


Chinese tree shrews (Tupaia belangeri chinensis) 
have become an increasingly important experimental 
animal in biomedical research due to their close 
relationship to primates. An accurately sequenced 
and assembled genome is essential for 
understanding the genetic features and biology of 
this animal. In this study, we used long-read single- 
molecule sequencing and high-throughput 
chromosome conformation capture (Hi-C) 
technology to obtain a high-quality chromosome- 
scale scaffolding of the Chinese tree shrew genome. 
The new reference genome (KIZ version 2: TS. 2.0) 
resolved problems in presently available tree shrew 
genomes and enabled accurate identification of 
large and complex repeat regions, gene structures, 
and species-specific genomic structural variants. In 


addition, by sequencing the genomes of six Chinese 
tree shrew individuals, we produced a 
comprehensive map of 12.8 M single nucleotide 


polymorphisms and confirmed that the major 
histocompatibility complex (MHC) loci and 
immunoglobulin gene family exhibited high 


nucleotide diversity in the tree shrew genome. We 
updated the tree shrew genome database 
(TreeshrewDB v2.0: http://www. treeshrewdb. org) to 
include the genome annotation information and 
genetic variations. The new high-quality reference 
genome of the Chinese tree shrew and the updated 
TreeshrewDB will facilitate the use of this animal in 
many different fields of research. 


Keywords: Tupaia belangeri, Chromosomal level 
assembly genome; Population sequencing; Database 
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INTRODUCTION 


Tree shrews (Tupaia belangeri) are widely distributed 
throughout South Asia, Southeast Asia (Fuchs & Corbach- 
Sóhle, 2010), and South and Southwest China (Peng et al., 
1991). They possesses many unique characteristics that are 
useful in biomedical research models, such as small adult 
body size (100—150 g), easy and low cost maintenance, short 
reproductive cycle (76 weeks), moderate life span (6-8 years), 
high brain-to-body mass ratio, and very close relationship to 
primates (Fan et al., 2013; Xiao et al., 2017; Xu et al., 2012; 
Yao, 2017; Zheng et al., 2014). Hitherto, tree shrews have 
been used in a wide variety of studies, including research on 
viral infection (Amako et al., 2010; Guo et al., 2018; Kock et 
al., 2001; Li et al., 2014; Li et al., 2016; Yang et al., 2013), 
visual cortex function (Bosking et al., 2002; Lee et al., 2016; 
MacEvoy et al., 2009; Mooser et al., 2004; Veit et al., 2014), 
brain development and aging (Fan et al., 2018; Wei et al., 
2017), and neuropsychiatric disorders induced by social 
stress (Fuchs, 2005; Meyer et al., 2001). Previously, we 
successfully sequenced the genome of the Chinese tree 
shrew (Tupaia belangeri chinensis) using Illumina short-read 
sequencing (KIZ version 1: TS_1.0) and showed their close 
relationship to non-human primates, thereby settling a long- 
running debate regarding the phylogenetic position of tree 
shrews within eutherian mammals (Fan et al., 2013). 
Furthermore, to advance the use of the tree shrew genome, 
we developed a user-friendly tree shrew database 
(TreeshrewDB: www. treeshrewdb. org) (Fan et al., 2014). The 
successful genome sequencing (Fan et al., 2013) and genetic 
manipulation of tree shrews (Li et al., 2017) have opened up 
new avenues for the wide usage of this species in biomedical 
research (Yao, 2017). 

Accurate genome sequencing and assembly are essential 
for understanding phylogenetic relationships and genome and 
phenome evolution (Kronenberg et al., 2018). Despite the fact 
that short-read sequencing technologies remain the most 
popular methods used to generate high-throughput data at 
relatively low cost (Schatz et al., 2010), whole-genome 
assembly of mammalian genomes based on these older 
sequencing technologies contains many problems, including 
assembly gaps and incomplete gene models (Sohn & Nam, 
2018). For instance, approximately 50% of the human 
genome comprises non-random repeat elements (Cordaux & 
Batzer, 2009) and a complex sequence structure, which is a 
major challenge in reference genome assembly (Phillippy et 
al., 2008). Although our earlier Chinese tree shrew genome 
(KIZ version 1: TS_1.0) produced in 2013 (Fan et al., 2013) 
had high sequencing coverage (79x), the assembled genome 
still contained 223 607 gaps (including 65 222 gaps in the 
genic region), and thus did not fully meet research needs. 
Single-molecule sequencing technology can generate reads 
tens of kilobases in size and can span most repeat 
sequences, which allows for complete reference genome 
assembly (Bickhart et al., 2017; Chaisson et al., 2015). High- 
throughput chromosome conformation capture — (Hi-C) 
technology can be used to study the three-dimensional 


architecture of genomes and can order, orient, and anchor 
contigs into chromosome-scale scaffolds (Burton et al., 2013). 
Here, we applied both long-read single-molecule sequencing 
and Hi-C technology to obtain a new reference genome for 
the Chinese tree shrew. We also generated a single 
nucleotide polymorphism (SNP) map of the tree shrew by 
whole-genome sequencing of six individuals. We updated the 
TreeshrewDB v2.0 (http://www. treeshrewdb. org) to 
incorporate the new reference genome and population genetic 
variations. 


MATERIALS AND METHODS 


Tissue samples and genome sequencing 

A male Chinese tree shrew from the Experimental Animal 
Center of the Kunming Institute of Zoology, Chinese Academy 
of Sciences, was used for single-molecule, real-time (SMRT) 
long-read sequencing (PacBio) and Hi-C sequencing. Ear 
tissues of six Chinese tree shrews were used for whole- 
genome sequencing using Illumina HiSeq X Ten (USA). This 
study was approved by the Institutional Review Board of the 
Kunming Institute of Zoology, Chinese Academy of Sciences 
(KIZ-SYDW-20101015-001 and KIZ-SMKX-20160315-001). 

We generated long-insert (20—40 kb) genomic libraries 
based on standard SMRT sequencing protocols developed by 
Pacific Biosciences (PacBio). The libraries were sequenced 
using the PacBio RS Il instrument with the P6-C4 sequencing 
reagent. Brain tissue from the same individual was used to 
construct the Hi-C libraries. Briefly, minced brain tissue was 
fixed in 296 formaldehyde for 10 min and then lysed in 2.5 mol/L 
glycine. Cross-linked genomic DNA was digested with Mbol 
(#B7024, New England Biolabs, UK). Sticky ends were filled 
with nucleotides, one of which was biotinylated. Ligation was 
performed under extremely dilute conditions favoring 
intramolecular ligation events: the Mbol site was lost and a 
Nhel (#R3131, New England Biolabs, UK) site was created. 
DNA was purified and sheared, and biotinylated junctions 
were isolated using streptavidin beads. Interacting fragments 
were sequenced by Illumina HiSeq X Ten (USA). 

For whole-genome sequencing of the six tree shrew 
individuals, short-insert read (300 bp) genomic libraries were 
constructed using the Illumina TruSeq Nano DNA Library Prep 
Kits (USA) and sequenced using the Illumina HiSeq X Ten 
(USA). 


Genome assembly and quality evaluation 

We applied Canu (Koren et al., 2017) to correct the SMRT 
reads, then used smartdenovo (https://github. com/ruanjue/ 
smartdenovo) to perform de novo assembly. The assembly 
was error-corrected using Quiver (Chin et al., 2013) and Pilon 
(Walker et al., 2014) based on alignment of 30-fold Illumina 
paired-end reads. The Hi-C sequencing reads were aligned to 
the assembled contigs using the bowtie2 end-to-end algorithm 
(Langmead & Salzberg, 2012). We used Lachesis (Burton et 
al., 2013) to cluster, order, and direct the assembled contigs 
onto 31 pseudo-chromosomes (TS_2.0 assembly), which was 
arbitrarily defined based on the number of haploid 
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chromosomes of the tree shrew (Liu et al., 1989). A total of 
4 104 benchmarking universal single-copy orthologs in the 
mammalian dataset of the Benchmarking with Universal 
Single-Copy Orthologs (BUSCO) (Simao et al., 2015) were 
mapped to the assembled contigs using tBlastn (Altschul et 
al., 1997) to assess overall assembly quality. We also used 
the whole-genome sequencing data of the male Chinese tree 
shrew to assess the quality of the TS_2.0 assembly. In brief, 
we aligned the reads to the TS 2.0 assembly and previous 
TS_1.0 assembly (Fan et al., 2013) using BWA (Li & Durbin, 
2009). We called genetic variants (SNPs and indels 
(insertions and deletions)) using FreeBayes (Garrison & 
Marth, 2012) and the structural variants (SVs) using Lumpy- 
SV (Layer et al., 2014), respectively. The feature response 
curve (FRC) (Vezzi et al., 2012) was estimated based on the 
aligned reads. The quality value (QV) was calculated as 
described previously (Bickhart et al., 2017): 


QV -—tes (2) (1) 


where, S indicates the cumulative length of all SNPs and 
indels identified using FreeBayes (Garrison & Marth, 2012) 
that had a probability of being heterozygous greater than 0.5, 
and B indicates the number of base pairs in the assembly that 
had at least 3x sequencing coverage. 


Annotation of repeats in genome 

We employed Tandem Repeats Finder v4.09 (Benson, 1999) 
to annotate the tandem repeats in the TS 2.0 assembly. The 
transposable elements (TEs) were identified based on a 
combination of de novo and homology-based predictions, as 
described in our previous study (Fan et al., 2013). Briefly, the 
RepeatModeler (Chen, 2004) was used to construct a de novo 
repeat library. We used RepeatMasker and 
RepeatProteinMask (Chen, 2004) to identify different types of 
TEs by aligning the TS 2.0 assembly with the known 
RepBase library (Chen, 2004) and the constructed de novo 
repeat library. 


Gene prediction and annotation 

A total of 13 RNA-seq datasets from our previous studies (Fan 
et al., 2013, 2018) were cleaned using Trimmomatic (Bolger et 
al., 2014), then aligned to the TS 2.0 assembly using Tophat2 
(Kim et al., 2013). The cleaned reads were also de novo 
assembled using Trinity (Grabherr et al., 2011). The above 
RNA-seq assemblies were further combined using PASA 
(Haas et al., 2008). 

For homology-based gene prediction, we downloaded 
protein sequences of humans (Homo sapiens), chimpanzees 
(Pan troglodytes), macaques (Macaca mulatta) and mice (Mus 
musculus) from Ensembl (release 7 1;https://asia.ensembl.org/ 
index. html), which display more accurate annotation of gene 
models. These protein sequences were mapped to the TS 2.0 
assembly using TblastN (Altschul et al., 1997). GeneWise 
(Birney et al., 2004) was used to define gene models. For ab 
initio gene prediction, Augustus (Stanke & Waack, 2003), 
Genescan (Salamov & Solovyev, 2000), SNAP (Korf, 2004), 
and GeneMark (Besemer & Borodovsky, 2005) were used to 
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predict coding genes. 

We employed EVidenceModeler (Haas et al. 2008) to 
combine the RNA-seq, cDNA, and protein alignments with 
different weights (RNA-seq»cDNA/protein»ab initio gene 
predictions) to achieve a comprehensive and non-redundant 
reference gene set. This gene set was further updated using 
PASA (Haas et al., 2008), followed by annotation based on the 
best matches derived from the protein sequence alignments 
described in the SwissProt and TrEMBL databases 
(O'Donovan et al, 2002) using Blastp (with default 
parameters) (Altschul et al., 1997). We annotated motifs and 
domains of proteins using InterPro (Mulder & Apweiler, 2007) 
to search publicly available databases, including Pfam (http:// 
pfam. sanger. ac. uk/), PRINTS (http://www. bioinf. manchester. 
ac.uk/dbbrowser/PRINTS/index. php), PROSITE (http://prosite. 
expasy. org/), ProDom (http://prodom. prabi. fr/prodom/current/ 
html/home. php), and SMART (http://smart. embl-heidelberg. 
de/). Descriptions of gene products, such as Gene Ontology 
(Ashburner et al., 2000) information, were retrieved from 
InterPro (Mulder & Apweiler, 2007). Pathway information was 
obtained by blasting the above reference gene set with the 
KEGG database (Kanehisa & Goto, 2000), with the best hit for 
each gene used for the annotation. 


Gene synteny map among different species 

We used the human (hg38; https://www. ncbi. nim. nih. gov/grc/ 
human), macaque (rheMac3 (Yan et al., 2011)), and mouse 
(GRCm38; https://www. ncbi. nim. nih. gov/grc/mouse) genomes 
and TS 2.0 to build a gene synteny map, as described 
previously (Fan et al., 2013). Briefly, the gene synteny map 
was constructed on the basis of orthologous genes. We did 
not use the whole genome alignment due to great sequence 
diversity among the species. The longest human, macaque, 
tree shrew, and mouse transcripts were chosen to represent 
each gene with alternative splicing variants. All protein 
sequences from the four species were aligned against the 
same protein set using BlastP with a similarity cutoff threshold 
of e-value=1x10°. With the human protein set as a reference, 
we found the best hit for each protein in the other species, 
with a criterion that more than 30% of the aligned sequence 
showed identity above 3096. Reciprocal best-match pairs were 
defined as orthologs. Orthologs not in the gene synteny 
blocks were removed from further analysis. For example, for 
three continuous genes (A, B, and C) in the human genome, if 
all three orthologs could be identified between humans and 
tree shrews based on the cutoff threshold described above, 
and the B gene in the tree shrew genome was not between 
genes A and C, or located in other scaffolds or other places 
within the same scaffold, then the B gene was removed. Using 
this method, we identified four-way gene synteny relationships 
for humans, macaques, tree shrews, and mice. The gene 
order information of the human genome was used to identify 
the macaque, tree shrew, and mouse genomic SVs. 


Whole genome sequencing and SNP calling 
Low-quality raw short reads were removed using Trimmomatic 
v0.32 (Bolger et al., 2014) with the parameters "LEADING: 3 


TRAILING: 3 SLIDINGWINDOW: 4: 15 MINLEN: 36". Quality- 
filtered reads were aligned to the reference TS 2.0 assembly 
using BWA-MEM (Li & Durbin, 2009). Picard Tools (http:// 
broadinstitute.github.io/picard/) were used to flag duplicate 
reads. Only non-duplicate reads were used for subsequent 
analyses. GenomeAnalysisTK-3.7 (GATK) (McKenna et al., 
2010) was used to realign indels and recalibrate base quality. 
We retained all single-nucleotide variants (SNVs) called by 
GATK UG with a Phred-quality score>Q10. The SNVs were 
hard filtered with the parameters "DP>8 & QD>5.0 & HRun«5 
& SB«0.00 & QUAL>50 & FS«600 & MQ>40.0 & 
HaplotypeScore>13.0". ANNOVAR was used to classify 
variants into different functional categories according to their 
locations and expected effects on encoded gene products 
(Wang et al., 2010). 


Population genetic analyses 

Nucleotide diversity (m) and Tajima's D value (Tajima, 1989) 
were estimated using VCFtools based on the six wild Chinese 
tree shrews, with a sliding window of 100 kb in each genome. 
For each coding gene, we estimated the population genetic 
parameters, including ~, Watterson theta estimate (0w) 
(Watterson, 1975), Tajima's D (Tajima, 1989), Fu and Li's D 
(Fu & Li, 1993), Fu and Li's F (Fu & Li, 1993), and Fay and 
Wu's H (Fay & Wu, 2000), using in-house perl scripts. 
Manhattan plot analysis was performed using the R package 
qqman (https://cran. r-project. org/web/packages/qqman/index. 
html). 


mRNA expression analysis 
The raw RNA-seq data (Supplementary Table S1) were 
trimmed to remove sequencing adapters and reads containing 
one or more Ns>5% or of low quality (more than 20% of the 
base's qualities were less than 10). The filtered reads were 
aligned to the reference genome TS 2.0 assembly using 
HISAT2 (Kim et al., 2015). The HTSeq-count (Anders et al., 
2015) was used to count aligned reads mapped with the 
above reference gene set. We calculated the FPKM 
(fragments per kilobase per million mapped reads) value 
using in-house perl script to quantify mRNA expression as 
follows: 

10°C 

NL/10° e 
where, FPKM refers to the mRNA expression level of gene A, 
C is the number of fragments uniquely aligned to gene A, N is 
the total number of reads uniquely aligned to all genes, and L 
is the base number in the coding region of gene A. 

For co-expression analysis, we used reported RNA-seq 
data from seven tree shrew brain tissues (Fan et al., 2013, 
2018) to calculate Pearson's correlation coefficients for each 
gene pair. A co-expression gene pair was defined by a 
Pearson's correlation coefficient cut-off of 0.8. 


FPKM = 





Tree shrew database 

Our developed tree shrew database (TreeshrewDB v2.0) runs 
on a dual-processor server with an Ubuntu operating system 
and is implemented under the LAMP (Linux-Apache-MySQL- 


Perl) software stack. The Chinese tree shrew genome TS 2.0 
assembly, gene set, gene annotation, and other information 
are stored in the MySQL, and are administrated with the help 
of phpMyAdmin. The web interfaces were developed using 
various computer languages such as HTML, CSS, JavaScript, 
and Perl. 


RESULTS 


Assembly of reference genome and quality evaluation 

We generated ~55x (148.58 Gb) whole-genome sequence 
coverage for the sampled adult male Chinese tree shrew 
using SMRT long-read sequencing technology (PacBio). After 
filtering poor-quality reads, we programmed a combination 
method of de novo assembly to generate a high-quality tree 
shrew genome (KIZ version 2: TS. 2.0, with a size of 2.67 Gb). 
The new assembly produced a total of 3 344 sequence 
contigs, with a 112-fold reduction in the number of contigs 
compared to that of our previous assembly (KIZ version 1: TS _ 
1.0) based on short reads (Fan et al., 2013) (Table 1). The 
contig N50 of TS 2.0 was 3.2 Mb and exhibited remarkable 
improvement (146-fold) compared with that of the previous 
assembly (TS 1.0) (Fan et al., 2013). Nearly 6096 of contigs 
(1 963/3 344) were longer than 100 kb and accounted for 
97.4% of the assembled genome. The longest contig was 
16.2 Mb (Table 1; Figure 1). We used BUSCO analysis, which 
is a powerful tool for assessing genome assembly and 
annotation completeness with single-copy orthologs (Simao et 
al., 2015), to evaluate the quality of the TS. 2.0 contigs. About 
91.4% of the 4 104 core genes in the mammalian dataset 
were complete BUSCO genes (Table 2). These tests all 
showed that the newly assembled tree shrew genome contigs 
had superior quality to those in recently reported ape 
genomes (Kronenberg et al., 2018). 

We also generated ~264 x (705 Gb) Hi-C data (Table 3) to 
cluster the contigs into chromosome-scale scaffolds. A total of 
1728 contigs (comprising 96.296 of the assembled genome 
sequence) were anchored into 31 pseudo-molecules, whereas 
1616 contigs (102 Mb, 3.896 of assembled genome 
sequence) were unanchored (Table 4). The final chromosome- 
scale scaffolding of the de novo genome assembly of the 
Chinese tree shrew (TS 2.0) had a scaffold N50 length of 104 
Mb, which is much more complete than the previous TS 1.0 
assembly (Fan et al., 2013) (Table 1). 

To compare the long-read tree shrew genome assembly 
(TS 2.0) in this study with the previous short-read assembly 
(TS 1.0) (Fan et al., 2013), we generated ~30x coverage 
Illumina paired-end read sequences from another tree shrew 
and aligned it to both assemblies. The identified SNPs and 
indels were used to estimate assembly accuracy. The TS 2.0 
assembly (quality value=28.56) had a higher quality value, as 
estimated using the number of non-matching base calls from 
FreeBayes (Bickhart et al., 2017; Garrison & Marth, 2012), 
than that of the TS 1.0 assembly (quality value=26.75) (Table 
5). In addition, the TS 2.0 assembly had 3-fold fewer SVs 
than that of TS 1.0 (Fan et al., 2013), thus suggesting fewer 
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Table 1 Comparison of Chinese tree shrew assembly quality between assemblies TS 1.0 and TS 2.0 





Version Item Contig length (bp) Scaffold length (bp) Contig No. Scaffold No. 
Total 2 719 442 484 2 861 790 358 374 120 150 513 
Max length 187 505 19 269 909 = = 
>2 000 bp = = 180 802 4 525 
Short-read assembly >100 kb = = 305 1418 
(KIZ version 1:TS_1.0) N50 22 000 3 655 608 36 335 234 
N60 17 500 3 042 664 50 199 319 
N70 13 431 2 302 651 67 915 427 
N80 9 571 1 648 848 91 810 573 
Total 2 667 337 536 2 667 507 236 3 344 1 647 
Max_length 16 177 999 224 450 918 = = 
>2 000 bp = = 3 344 1647 
Long-read assembly >100 kb E = 1963 281 
(KIZ version 2:TS_2.0) N50 3 217 288 104 643 080 229 10 
N60 2 462 062 94 037 081 323 13 
N70 1 641 093 71 760 103 457 16 
N80 995 871 57 328 337 664 20 


Contig: Contiguous length of genomic sequence in which the order of bases has a high confidence level. Gaps occur where reads from two 
sequenced ends of at least one fragment overlap with other reads in two different contigs. Scaffolds are composed of contigs and gaps. N50: N50 
statistic defines assembly quality in terms of contiguity. Given a set of contigs ranked by contig size, N50 is defined as the size of the shortest 
contig, which adds contigs with larger size to reach 5096 of total genome length. —: Not available. 


Table 2 Assessment of assembly completeness in Chinese tree 
shrew using BUSCO 





Parameter No. Percentage (96) 
Complete genes 3 749 91.40 
Complete and single-copy genes 3696 90.10 
Complete and duplicated genes 53 1.30 
Fragmented genes 184 4.50 

Missing genes 171 4.10 

Total genes 4 104 x: 


BUSCO: Benchmarking with Universal Single-Copy Orthologs. A total 
of 4104 benchmarking universal single-copy orthologs of the 
mammalian dataset were retrieved from BUSCO (Simao et al., 2015). 
These genes were mapped to the TS 2.0 assembly using tBlastn 
(Altschul et al., 1997). — Not available. 


assembly errors (Table 5). Quality evaluation using the FRC 
method (Vezzi et al., 2012) also showed TS. 2.0 to be a better 
assembly (Table 5). 

About 73% of the gaps (163 220 gaps, 93.10 Mb) in the TS _ 
1.0 assembly (Fan et al., 2013) were filled by the 49.15 Mb 
long-read sequences in the TS 2.0 assembly. Among these 
fully closed gaps, 65 222 were located in the genic regions. 
Only 39 gaps in TS 2.0 were fully closed by TS 1.0 (Table 6). 
We note that 4 112 gaps in TS 1.0 had flanking sequences 
that were mapped to separate pseudo-chromosomes in TS . 
2.0, indicating assembly errors in TS. 1.0 (Fan et al., 2013). 

The updated genome assembly is available at 
TreeshrewDB v2.0 (http://www. treeshrewdb. org) and has 
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Table 3 Statistics of Hi-C data for mapping 


Parameter Hi-C data 
Clean data 705 Gb 

Clean paired-end reads 2 351 150 069 
Unmapped paired-end reads 47 514 976 
Unmapped paired-end reads rate (%) 2.02 
Paired-end reads with singleton 303 352 692 
Paired-end reads with singleton rate (%) 12.9 

Multi mapped paired-end reads 443 734 174 
Multi mapped ratio (%) 18.87 

Unique mapped paired-end reads 1 556 548 227 
Unique mapped ratio (%) 66.2 


been deposited in GSA (accession No. PRJCA001472; http:// 
gsa.big.ac.cn/) (Wang et al., 2017). 


Repeat content in tree shrew genome 

Repeat content in a genome poses a daunting difficulty for 
sequence assembly (Kronenberg et al., 2018). The function of 
repeat content has also begun to be recognized (Chuong et 
al., 2017). The TS_2.0 assembly presented an opportunity to 
identify and study full-length repeats. Here, 49.14% (up to 
1.31 Gb) of the TS 2.0 assembly was identified as 
interspersed repeats, which represents an increase of 308 Mb 
repeat elements relative to TS. 1.0 (Fan et al., 2013) (Table 7). 
Among the defined repeat elements, LINE1 (L1, long 
interspersed nuclear elements 1) repeats accounted for the 
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Figure 1 Assembly, annotation, and nucleotide diversity of Chinese tree shrew genome 


A: Contig length distribution of long-read assembly (TS 2.0) in comparison with short-read assembly (TS 1.0) (Fan et al., 2013). B: Circos plot 
showing genome-wide distribution profiles of genes, SNPs, and indels across Chinese tree shrew genome, and values of population genetic 
parameters (rr and Tajima' s D). C: Manhattan plot of nucleotide diversity (rr) at gene level based on SNPs located in coding regions of six wild tree 


shrews. Top 30 genes are shown in plot, with a cut-off rr value of 0.025. 
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Table 4 Pseudo-chromosome sizes and assignment of Hi-C 
scaffolds 


Length (bp) of pseudo- 





Pseudo-chromosome Contig No. chromosome 
chr1 154 224 402 198 
chr2 107 187 971 973 
chr3 111 137 178 494 
chr4 61 121 533 334 
chr5 74 120 860 892 
chr6 88 117 379 583 
chr7 67 108 205 678 
chr8 56 108 052 698 
chr9 62 104 638 498 
chr10 64 101 327 006 
chr11 71 97 509 983 
chr12 49 94 027 333 
chr13 54 92 296 458 
chr14 58 89 547 586 
chr15 69 71 741 294 
chr16 42 69 742 744 
chr17 48 66 945 814 
chr18 41 63 456 188 
chr19 27 57 308 528 
chr20 32 54 551 840 
chr21 35 49 758 179 
chr22 53 52 049 165 
chr23 27 43 809 476 
chr24 23 42 251 409 
chr25 33 41 996 642 
chr26 100 30 565 635 
chr27 50 25 814 610 
chr28 34 26 506 761 
chr29 27 22 607 893 
chr30 28 21 670 314 
chrX 452 118 492 391 
Total anchored 2 197 2 564 200 597 
Unanchored 1616 102 561 400 


We used Lachesis (Burton et al., 2013) to cluster, order, and direct the 
assembled contigs onto 31 pseudo-chromosomes, which were defined 
according to number of haploid chromosomes of the tree shrew (Liu et 
al., 1989). Contig No.: Number of contigs assembled onto each 
chromosome by Hi-C. Total anchored: total number of contigs that 
could be anchored into 31 pseudo-chromosomes. Unanchored: total 
number of contigs that could not be anchored into 31 pseudo- 
chromosomes. 
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Table 5 Assembly quality score value statistics 





Parameter Long-read assem- Short-read assem- 
bly (TS_2.0) bly (TS_1.0) 
Quality value 28.56 26.75 
Translocation 2 824 6 034 
Deletion 3 733 12 607 
Duplication 142 438 
Inversion 80 99 
Errors Per 100 Mbp 253.89 718.27 
HIGH_COV_PE 12 016 66 655 
HIGH_NORM_COV_PE 12415 69 902 
HIGH_OUTIE_PE 137 1 594 
HIGH_SINGLE_PE 10 151 
HIGH_SPAN_PE 1 237 32 751 
LOW NORM COV PE 536 72 38 
STRECH PE 31 741 66 763 
COMPR PE 13 818 20 437 


Quality value was estimated based on number of non-matching base 
calls from FreeBayes (Garrison & Marth, 2012). Errors per 100 Mbp 
were calculated as a sum ratio of Lumpy (Layer et al., 2014) structural 
variants (SV) to a standardized genome size of 2.67 Gbp. FRC 
features (Vezzi et al., 2012) can assess assembly errors, including 
LOW COV PE: Low read coverage; HIGH COV PE: High read 
coverage; LOW NORM COV PE: Low coverage of normal paired- 
end reads; HIGH NORM COV PE: High coverage of normal paired- 
end reads; COMPR PE: Areas with low CE statistics; STRECH PE: 
Areas with high CE statistics; HIGH. SINGLE PE: Regions with high 
numbers of unmapped pairs; HIGH SPAN PE: Regions with high 
numbers of discordant pairs that map to different contigs/scaffolds; 
HIGH. OUTIE PE: Regions with high numbers of misoriented or 
distant pairs. With the exception of the QV score, lower counts are 
indicative of better assembly. 


highest proportion in TS 2.0 (18.5496 of genome size; Table 
8), similar to that of L1 in the human genome (Beck et al., 
2010). The tree shrew specific tRNA-derived Tu-lll family, the 
largest proportion of the SINE (short interspersed nuclear 
elements) in the Chinese tree shrew genome (Fan et al., 
2013), accounted for 15.1796 of genome size in TS 2.0 (Table 
8). The reason for the unusually high prevalence of the tRNA- 
derived Tu-lll family in the tree shrew genome remains to be 
determined. Because of the improvement in genome quality, 
we were able to identify 127 long transposable elements 
(each >20 kb). We also defined 4 411 709 satellites (total 
length of 131 Mb) in TS 2.0. Among them, 4 293 990 were 
short tandem repeats (each «150 bp) and 1 152 were long 
tandem repeats (each >5kb). The longest tandem repeat was 
mapped to a non-coding region in the end of pseudo- 
chromosome 26 and had a length of 168.3 kb (period size= 
359 bp, copy number=471). We note that some long tandem 
repeats within the genic regions were located in gaps in the 
TS 1.0 assembly (Fan et al., 2013). For instance, a long 


Table 6 Gap closure statistics of the two genome assemblies 


Parameter 
Total number of gaps 1 697 
Partially closed gap using TS_1.0 476 


Partially closed gap using TS_2.0 = 
Fully closed gap using TS_1.0 39 
Fully closed gap using TS_2.0 = 
Fully closed gap in genic region 0 


Trans-scaffold gaps 264 


Long-read assembly TS_2.0) 


Short-read assembly (TS 1.0) 
223 607 


0 

163 220 
65 222 
4112 


Partially closed gap: Gap in one assembly was filled by a scaffold of another assembly, but still had some ambiguous (N) bases within the filled 
region. Fully closed gap: Gap in one assembly was filled by a contig of another assembly, without any ambiguous (N) bases. Trans-scaffold gap: 
Flanking sequences of a gap were aligned to two separate scaffolds or pseudo-chromosomes, which was most likely to be assembly errors. —: Not 


available. 


tandem repeat (period size- 1917 bp, copy number-26) 
overlapped with the coding sequence of the OS9 gene 
(osteosarcoma amplified 9, endoplasmic reticulum lectin), 
which plays a key role in the endoplasmic reticulum stress 
response associated with hypoxia (Satoh et al, 2010). 
Therefore, these long tandem repeats in the tree shrew 
genome may be functional. However, focused studies are 
required for their characterization. 


Table 7 Comparison of transposable elements in Chinese tree 
shrews between short-read assembly (KIZ version 1: TS 1.0) and 
long-read assembly (KIZ version 2: TS 2.0) 


Long-read assembly Short-read assembly 





Type (TS 2.0) (TS 1.0) 
Length (Mp) % in genome Length (Mp) % in genome 
DNA 96.8 3.6 76.6 2.7 
LINE 553.3 20.8 295.2 10.3 
SINE 663.1 24.9 527.2 18.8 
LTR 138.0 5.2 113.1 4 
Other 0.0005 0.0 0.06 0.002 
Unknown 68.0 2.6 0.9 0.03 
Total 1 310.5 49.1 1 001.9 35 


DNA: Deoxyribonucleic acid transposon. LINE: Long interspersed 
nuclear element. SINE: Short interspersed nuclear element. LTR: Long 
terminal repeat. 


Gene annotation updates 

We combined the  homology-based, de novo, and 
transcriptome-based methods (Haas et al., 2008) to predict 
protein-coding genes in the TS 2.0 assembly and identified a 
total of 23 568 non-redundant protein-coding genes (Fan et 
al., 2013) (Table 9; Figure 1B). Among these genes, the 
majority (22 907 genes) were supported by the reported RNA- 
seq data in our recent studies (Fan et al., 2013, 2018). The 
newly updated gene set had longer coding sequences, which 
were, on average, composed of more exons (Table 9) 
compared with the TS 1.0 gene set (Fan et al., 2013). The 


gaps in 2 091 exons in TS 1.0 (Fan et al., 2013) were all filled 
in TS 2.0, thus providing better annotation information for the 
genes. For instance, L/LRB3 (leukocyte immunoglobulin like 
receptor B3), which binds to the major histocompatibility 
complex (MHC) class | molecules on antigen-presenting cells 
to inhibit stimulation of immune response (Huang et al., 2010), 
was complete in TS 2.0, but less than 5096 of this gene 
sequence was retrieved in TS 1.0 (Fan et al., 2013). BMP8A 
(bone morphogenetic protein 8a), which plays a role in the 
development of the reproductive system (Wu et al., 2017), 
exhibited low protein sequence identity (57.2296) with human 
homolog in TS 1.0 (Fan et al., 2013) due to assembly error 
and gaps, but the sequence identity reached 88.94% in TS 
2.0. In addition, ALOX15 (arachidonate 15-lipoxygenase), 
which uses polyunsaturated fatty acid substrates to generate 
various bioactive lipid mediators, such as eicosanoids, 
hepoxilins, and lipoxins (Kuhn et al., 2018; Singh & Rao, 
2019), had only one copy in TS 1.0 (Fan et al., 2013) but four 
copies in TS 2.0. The updated versions of these important 
genes have provided a good basis for further specific 
functional characterization. 

Of the annotated genes, 20 811 (88.396) were functionally 
classified according to InterPro (Mulder & Apweiler, 2007), GO 
(Ashburner et al., 2000), KEGG (Kanehisa & Goto, 2000), 
Swissprot, and TrEMBL (O'Donovan et al., 2002). In addition, 
586 genes were newly annotated in TS 2.0 (Table 10). All 
these genes can be retrieved from TreeshrewDB v2.0. 

It should be mentioned that the MHC region (starting from 
MOG to COL11A2 (Beck et al., 1999) in pseudo-chromosome 
3) was completely assembled (Figure 2A) in the TS 2.0 
assembly. It was previously difficult to assemble this region 
using short-read sequencing technologies as it is highly 
polymorphic and repetitive. There were 412 gaps in the MHC 
region in TS. 1.0 (Fan et al., 2013), which were all filled in TS . 
2.0. Thus, the tree shrew has more MHC class | genes (n=8 
according to TS 2.0) than those identified in humans (n=6), 
although fewer than those identified in mice (n=12) (Elmer & 
McAllister, 2012) (Figure 2B). 
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Table 8 Comparison of transposable element subtypes in Chinese tree shrews between short-read assembly (KIZ version 1: TS_1.0) and 


long-read assembly (KIZ version 2: TS_2.0) 


Long-read assembly 


Short-read assembly 








(TS_2.0) (TS_1.0) 
TE subtype 

Length % in Length % in 

(Mp) genome (Mp) genome 
DNA/En-Spm 7.92 0.30 4.87 0.17 
DNA/hAT 34.29 1.28 33.77 1.18 
DNA/TcMar 52.11 1.96 26.90 0.94 
LINE/CR1 5.57 0.21 2.00 0.07 
LINE/L1 494.51 18.54 267.29 9.34 
LINE/L2 49.48 1.86 22.04 0.77 
LINE/Penelope 2.02 0.08 2.29 0.08 
LTR/ERV1 37.66 1.41 31.77 1.11 
LTR/ERVK 18.55 0.70 8.87 0.31 
LTR/ERVL 78.13 2.93 68.40 2.39 
LTR/Gypsy 2.59 0.10 2.86 0.1 
SINE/Alu 10.60 0.40 3.15 0.11 
SINE/B4 3.02 0.11 1.72 0.06 
SINE/MIR 47.40 1.78 23.75 0.83 
SINE/tRNA-Lys 15.04 0.56 1.14 0.04 
SINE/Tu-III 404.49 15.17 410.09 14.33 


Table 9 Comparison of Chinese tree shrew gene annotation 
between short-read assembly (KIZ version 1: TS 1.0) and long- 
read assembly (KIZ version 2: TS 2.0) 


Long-read assem- Short-read assem- 
Parameter 





bly (TS 2.0) bly (TS 1.0) 

Total number of genes 23 568 22 121 
Complete ORFs 21 117 21 085 
Annotated genes 20 811 20 225 
Average mRNA length 40 114 33 712 
Average CDS length 1527 1 404 
Average exon number 8.86 7.54 
Average exon length 172 186 
Average intron length 4 907 4 937 


ORF: open reading frame. CDS: coding-region sequences. 


Genomic structural variants 

The genome TS 2.0 assembly improved sequence continuity 
and provided an opportunity to explore species-specific 
genomic SVs in genic regions. We used the human (hg38; 
https://www. ncbi. nim. nih. gov/grc/human), macaque (rheMac3 
(Yan et al., 2011)), and mouse (GRCm38;https://www.ncbi.nlm. 
nih. gov/grc/mouse) genomes and TS 2.0 to construct a 
synteny map of orthologous genes, using the human genome 


514  www.zoores.ac.cn 


Table 10 Comparison of Chinese tree shrew gene functional 
annotation between short-read assembly (KIZ version 1: TS 1.0) 
and long-read assembly (KIZ version 2: TS 2.0) 


Short-read assembly — Long-read assembly 











Functional (TS. 1.0) (TS 2.0) 
annotation 

No. Percent (%) No. Percent (%) 
InterPro 17420 78.7 17534 74.4 
KEGG 16593 75.0 16964 72.0 
Swissprot & TrEMBL 20225 91.4 20811 = 88.3 
Unannotated 1 896 8.6 2 309 11.7 
Total 22121 - 23568 - 


InterPro (http://www. ebi.ac.uk/interpro/). KEGG (https://www.kegg.jp/). 
Swissprot & TrEMBL (https://web. expasy. org/docs/swiss- 
prot guideline.html). —: Not available. 


as a reference. We identified 221 SVs in tree shrews 
(Supplementary Table S2) 188 SVs in macaques 
(Supplementary Table S3) and 387 SVs in mice 
(Supplementary Table S4), suggesting that the tree shrew's 
genomic structure had an overall higher similarity to that of 
primates than to that of mice. A detailed comparison of the 
SVs showed that the tree shrews had a seemingly mosaic 
pattern with some similarities to rodents and others to 
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Figure 2 Chinese tree shrew and human MHC genes 





A: Synteny of MHC genes between Chinese tree shrews and humans. HLA class | & Il genes are in red, other genes are in black. Tree shrew TS _ 
2.0 assembly and human genome (hg38; https://www.ncbi.nlm.nih.gov/grc/human) were used for comparison. B: Phylogenetic relationship of MHC- 


class | genes in humans, tree shrews, and mice. 


primates. For instance, the tree shrew and primates (human 
and macaque) had a specific genomic SV in the region 
starting from MYSM!1 to SLC35D1, which was inverted in the 
mouse genome (Figure 3A). Some SVs existed in the tree 
shrew and mouse, but primates had different counterparts, 
such as the region from PRKAB2 to POLR3GL (Figure 3B). 
Note that in this region, GPR89B (G protein-coupled receptor 


Genomic sequence variations at population level 
To understand genomic sequence variations in the Chinese 
tree shrew, we analyzed the whole-genome sequencing data 
of six individuals (each with a sequencing depth of 30x). After 
mapping to TS 2.0, we identified a total of 12.8 million (M) 
SNPs in these individuals (Figure 1B), with 293 128 (including 
194 751 synonymous and 98 377 non-synonymous SNPs) 
located in the coding regions. 

We estimated population genetic parameters for the 


89B) and NOTCH2NL (notch 2 N-terminal like A) were only 
present in the human genome (Figure 3B). The updated TS _ 
2.0 assembly has thus provided more opportunities to 
understand the evolution of SVs and potentially disrupted 
genes in the tree shrew genome. The exact reason for the 
occurrence of species-specific SVs and their potential 
evolutionary and functional effects await further study. 


Chinese tree shrew using the six captive individuals. We 
calculated the nucleotide diversity based on SNPs located in 
coding regions and identified 30 genes with high nucleotide 
diversity based on a cut-off m value of 0.025 (Figure 1C). 
Among these genes, five were located in the MHC loci or 
belonged to the immunoglobulin gene family, suggesting that 
immune genes may have a relatively high evolutionary rate in 
tree shrews, although this needs to be validated by analyzing 
more samples and including non-coding regions (Figure 1C). 
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Figure 3 Examples of structural variants in mouse, macaque, tree shrew, and human genomes 
A: Chinese tree shrews and humans, but not mice, shared a specific genomic structure in the region from MYSM!1 to SLC35D1. B: Chinese tree 


shrews and mice shared a specific genomic structure in the region from PRKAB2 to POLR3GL, which has undergone dramatic changes in humans. 


GPR89B (G protein-coupled receptor 89B) and NOTCH2NL (notch 2 N-terminal like A) genes, marked in green, were only present in the human 


genome. These genomes were retrieved from public domains (mouse: GRCm38; https://www.ncbi.nlm.nih.gov/grc/mouse; macaque: rheMac3(Yan 


et al., 2011); human: hg38; https://www.ncbi.nlm.nih.gov/grc/human) or generated in this study (tree shrew: TS 2.0). 
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Whether or not this pattern reflects a compensatory effect due 
to the loss of RIG-I in the tree shrew genome (Fan et al., 
2013; Xu et al., 2016; Yao, 2017) remains to be studied. We 
calculated Tajima's D (Tajima, 1989) for each gene, and found 
discrete distribution, with no obvious clustering 
(Supplementary Figure S1). Results for Fu and Li's D test 
(Fu & Li, 1993), Fu and Li's F test (Fu & Li, 1993), Fay and 
Wu's H test (Fay & Wu, 2000) all showed a pattern similar to 
the Tajima's D test. Nonetheless, these results should be 
treated with caution, as they may be biased by the limited 
sample size. 


Tree shrew database updates 

Based on the TS 2.0 assembly, we updated TreeshrewDB 
v2.0 (Figure 4) to distribute the new high-quality tree shrew 
genome and our newly annotated gene and genome 
information. The main database updates included revision and 
expansion of genomic data, gene co-expression patterns, 
population genetic statistics, and improvements to the web 
interface. Briefly, for the retrieval module, we updated the 
reference sequence ID, genomic location and map, transcript 
sequence, and functional annotation based on the new gene 
set. We added five primate species (gibbon (Nomascus 
leucogenys), golden snub-nosed monkey (Rhinopithecus 
roxellana), black snub-nosed monkey (R. bieti), Bolivian 
squirrel monkey (Saimiri boliviensis boliviensis), and 
bushbaby (Otolemur garnettii)) in the orthologous gene sets 
from Ensembl (release 71;https://asia.ensembl.org/index.html) 
to allow for better comparison for one-to-one homologs. The 
mRNA expression pattern was upgraded based on RNA-seq 
data from 26 tissues and/or cells (Supplementary Table S1). 
For each gene query, it is possible to retrieve basic 
information on the queried gene, sequence alignment with 
homologs of other species, MRNA expression levels in tissues/ 
cells, co-expression patterns in brain tissues, sequence 
variations at the population level, and results of population 
genetic parameters (including =, Watterson theta estimate 
(8w) (Watterson, 1975), Tajima's D (Tajima, 1989), Fu and Li's 
D (Fu & Li, 1993), Fu and Li's F (Fu & Li, 1993), and Fay and 
Wu's H (Fay & Wu, 2000)) (Figure 4). 

We added the new Chinese tree shrew reference genome 
(TS_2.0) to the TreeshrewDB v2.0, which is free to download. 
The updated gene sequences can be extracted in batches or 
individually by our homemade ExtractSeq. We incorporated 
Blast (Altschul et al., 1997) and Genewise (Birney et al., 2004) 
to show the mapping of genes in the genome. Overall, the 
updated database now provides a comprehensive annotation 
of the Chinese tree shrew genome to satisfy the needs of 
evolutionary analysis and biomedical research. 


DISCUSSION 


The combination of long-read sequencing and long-range 
chromosome interaction mapping (such as Hi-C) represents 
the most efficient approach to produce high-quality reference 
genome assembly (Bickhart et al., 2017; Kronenberg et al., 
2018). In this study, we used these techniques to generate an 


updated reference genome for the Chinese tree shrew (KIZ 
version 2: TS_2.0) and resolved some of the problems from 
our earlier tree shrew genome (Fan et al., 2013). The updated 
TS_2.0 assembly enabled accurate identification of large and 
complex repeat regions, gene structures, and species-specific 
genomic SVs in the genic regions. This high-quality tree shrew 
genome will facilitate the use of this species in both 
biomedical and basic research, such as annotation and 
interpretation of RNA-seq data from normal and pathological 
tissues (Supplementary Table S1), and for a more 
comprehensive understanding of the evolution of primate- 
specific SVs and their potential regulatory changes 
(Kronenberg et al., 2018). For instance, we identified 221 SVs 
in the genic regions of the Chinese tree shrew genome and 
found that the overall pattern of SVs in the tree shrew more 
resembled that of primates than that of rodents 
(Supplementary Tables S2-—4), further confirming the very 
close relationship between tree shrews and primates (Fan et 
al., 2013; Yao, 2017). It should be mentioned that the TS_2.0 
assembly still misses many large and complex SVs due to the 
limitations of current sequencing technology and assembly 
approaches. Moreover, we did not experimentally validate the 
SVs between TS 1.0 and TS 2.0, which would offer further 
information regarding the construction of a well-defined 
reference genome of the Chinese tree shrew. We will continue 
to refine the tree shrew genome using more data in the future. 
In general, the new TS 2.0 assembly filled most of the gaps 
and corrected most assembly errors present in the previous 
tree shrew genome (Fan et al., 2013), thereby providing better 
gene annotations. To understand the unique genetic features 
of the tree shrew genome, such as long tandem repeats, 
repeat content, and genomic SVs, detailed studies should be 
carried out in the future. 

In our previous study, we built the TreeshrewDB (Fan et al., 
2014) for easy access to the Chinese tree shrew genome data 
based on short-read sequencing technology (Fan et al., 
2013), which has been visited frequently and used by many 
researchers. We comprehensively updated TreeshrewDB v2 
based on the new high-quality reference genome (TS 2.0) 
generated in this study. We optimized the visualizations of 
gene annotation and genomic variations of the tree shrew and 
included results from population genetic parameters for this 
species. Furthermore, the inclusion of the reported 
transcriptomic data from 26 tissues and cells (Supplementary 
Table S1) has enhanced our knowledge of mRNA expression 
profiling in the Chinese tree shrew. This database will be 
regularly updated to include recently released genetic data 
and serve as a platform for data sharing among tree shrew 
studies and for further elucidation of the genetic features of 
this animal. We believe that the tree shrew genome 
assembly TS_2.0 and the updates will meet the increasing 
needs in the field. 


SUPPLEMENTARY DATA 


Supplementary data to this article can be found online. 
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Figure 4 Overview of updated tree shrew database (TreeshrewDB version 2.0) 

Inclusion of the high-quality reference genome assembly (TS_2.0) in TreeshrewDB version 2.0 provided a comprehensive update of gene 
annotation information, genomic variations, population genetic features, and mRNA expressions. Population genetic parameters (including rr, 
Watterson theta estimate (Ow) (Watterson, 1975), Tajima' s D (Tajima, 1989), Fu and Li's D (Fu & Li, 1993), Fu and Li's F (Fu & Li, 1993), and Fay 
and Wu’s H (Fay & Wu, 2000)) were estimated based on SNPs located in coding regions in the whole genome sequences of six wild tree shrews. 


The database is freely accessible at http://www.treeshrewdb.org. 
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ABSTRACT 


Microbial translocation is a cause of systemic 
immune activation in HIV/SIV infection. In the 
present study, we found a lower CD8* T cell 
activation level in Macaca leonina (northern pig- 
tailed macaques, NPMs) than in Macaca mulatta 
(Chinese rhesus macaques, ChRMs) during 
SIVmac239 infection. Furthermore, the levels of 
plasma LPS-binding protein and soluble CD14 in 
NPMs were lower than those in ChRMs. Compared 
with ChRMs, SIV-infected NPMs had lower Chiu 
Scores, representing relatively normal intestinal 
mucosa. In addition, no obvious damage to the ileum 
or colon epithelial barrier was observed in either 
infected or uninfected NPMs, which differed to that 
found in ChRMs. Furthermore, no significant 
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microbial translocation (Escherichia coli) was 
detected in the colon or ileum of infected or 
uninfected NPMs, which again differed to that 
observed in ChRMs. In conclusion, NPMs retained 
superior intestinal integrity and limited microbial 
translocation during SIV infection, which may 
contribute to their lower immune activation 
compared with ChRMs. 
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INTRODUCTION 


Immune activation is a hallmark of HIV/SIV infection and is 
closely correlated with disease progression (Boasso & 
Shearer, 2008; Hazenberg et al., 2003). In addition, T cell 
activation is more predictive of AIDS disease progression than 
plasma viral load and CD4* T cell count (Giorgi et al., 1999; 
Liu et al., 1998). Immune activation may accelerate the 
depletion or impede the reconstitution of CD4* T cells by 
exacerbating lymphoid tissue fibrosis (Estes et al., 2008; Hunt 
et al., 2003; Zeng et al., 2011). Furthermore, a number of non- 
AIDS diseases are also correlated with immune activation, 
such as cardiovascular disease and cancer (Cadogan & 
Dalgleish, 2008; Kaplan et al., 2011; Klatt et al., 2013). 
However, immune activation is difficult to alleviate, even in 
patients with sustained viral suppression during antiretroviral 
therapy (French et al., 2009). Therefore, therapeutic strategies 
to relieve persistent immune activation are needed for patients 
to improve their quality of life. 

Microbial translocation is a significant factor that contributes 
to persistent immune activation (Brenchley et al., 2006; 
Sandler & Douek, 2012). HIV/SIV infection can cause severe 
mucosal CD4* T cell depletion and damage to intestinal 
integrity, followed by translocation of microbial products into 
circulation, including 16S rDNA and lipopolysaccharide (LPS) 
(Brenchley et al., 2004; Dillon et al., 2012; Marchetti et al., 
2013; Veazey et al., 1998). As a result, soluble CD14 (sCD14) 
released by activated monocytes and LPS-binding protein 
(LBP) levels increase (Brenchley et al., 2006). Plasma sCD14 
levels can predict mortality in HIV infection (Sandler et al., 
2011). In addition, elevated microbial translocation is 
associated with increased risk of AIDS-related lymphoma and 
HIV-associated dementia (Ancuta et al., 2008; Marks et al., 
2013). Therefore, therapeutic strategies targeting microbial 
products and their downstream effects will be of great benefit. 

Pig-tailed macaques are divided into three species based 
on morphological characteristics (Spartaco, 2001). Thereinto, 
Macaca nemestrina (southern pig-tailed macaques, SPMs) 
are suitable animal models to study the relationship between 
microbial translocation and disease progression of AIDS due 
to their compromised intestinal integrity and increased 
microbial translocation prior to SIV infection (Klatt et al., 
2010). Our recent studies indicate, however, that pre-infection 
peripheral CD4* T cell counts in Macaca leonina (northern pig- 
tailed macaques, NPMs) differ from those in Macaca 
nemestrina (Klatt et al., 2012; Zheng et al., 2014b). In 
addition, CD4* T cells also exhibit superior homeostasis in 
SIVmac239-infected NPMs compared with Chinese rhesus 
macaques (ChRMs), which is not reported in SIV-infected 
SPMs (Zhang et al., 2017a). This raises the question as to 
whether compromised intestinal integrity and increased 
microbial translocation occur in pre- and post-infected NPMs 
similarly. 

In the present study, we analyzed the levels of CD8* T cell 
activation and plasma biomarkers of microbial translocation in 
SIVmac239-infected NPMs and ChRMs and normal NPMs. To 
further analyze the status of microbial translocation, intestinal 


integrity and microbial translocation of NPMs were also 
studied. 


MATERIALS AND METHODS 


Animals and sample collection 
Viral seroprevalence in NPMs and ChRMs enrolled in this 
study was investigated as described previously (Zhang et al., 
2016b). All NPMs and ChRMs were healthy males and were 
seronegative for SIV, STLV, SRV, and SFV prior to the study. 
Six NPMs and three ChRMs (6-8 years old) were inoculated 
with a 3 000 50% tissue culture infectious dose (TCID4)) of 
SlVmac239 intravenously. Blood was collected periodically, 
and tissues were obtained from the macaques at necropsy. 
Tissue and blood samples from four uninfected NPMs were 
also obtained as the control. In addition, blood from five 
uninfected ChRMs and tissues from one SIVmac239-infected 
ChRM were also obtained. Blood was partially used for flow 
cytometry analyses and isolation of plasma and peripheral 
blood mononuclear cells (PBMCs). All tissues were partially 
preserved in 496 paraformaldehyde and -80 ? C immediately 
after collection. Detailed information on animals used in 
histopathological assessment is shown in Table 1. 

All animal experiments and procedures were approved by 
the Ethics Committee of the Kunming Institute of Zoology, 
Chinese Academy of Sciences (approval No.: SYDW-2015023; 


Table1 Animals enrolled in this study 


Survival days 





Animal ID Species Status Age post infection 
08247 NPM Infected y» 491^ 
08287 NPM Infected 7 641° 
09203 NPM Infected 6? 648° 
09211 NPM Infected 6? 476° 
10205 NPM Infected 58 626° 
10225 NPM Infected 5? 634° 
08091 ChRM Infected 7? 320° 
08303 ChRM Infected 7? 475° 
08439 ChRM Infected 7 628° 
08051 ChRM Infected T? 402^ 
08269 NPM Uninfected ged - 
09223 NPM Uninfected ged - 
09237 NPM Uninfected ged - 
09255 NPM Uninfected ged - 
11049 ChRM Uninfected 6* = 
105353 ChRM Uninfected T° = 
10039 ChRM Uninfected f° = 
09003 ChRM Uninfected 8° = 
090017 ChRM Uninfected 8° - 


NPM: Northern pig-tailed macaque; ChRM: Chinese rhesus macaque. 
a: Age at infection. ^: Natural death. °: Euthanasia. ^: Age at necropsy. 
*: Blood collection only. - : Not available. 
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approval date: 20 June 2015). 


Flow cytometry 

Multiparameter flow cytometry was performed using peripheral 
blood, as described previously (Ma et al., 2012; Xia et al., 
2009; Zhang et al., 2017b; Zheng et al., 2014a). For the 
frequency of HLA-DR*CD8* T cells, blood was treated with 
lysing buffer (BD Biosciences, CA, USA) for 10 min and then 
incubated with a mixture of flow cytometry antibodies at 4 °C 
for 30 min. For staining with ki67, surface-labeled cells were 
further treated with fixation and permeabilization solution (BD 
Biosciences, CA, USA), followed by perm/wash buffer (BD 
Biosciences, CA, USA). Cross-reactive flow cytometry human 
antibodies anti-CD3 APC-Cy7 (clone SP34-2), anti-CD8 PE- 
Cy? (clone RPA-T8), anti-HLA-DR APC (clone G46-6), and 
anti-ki67 PE (B56) were purchased from BD Pharmingen 
(Franklin Lakes, NJ, USA). Anti-CD4 PerCP-Cy5.5 (clone 
OKT4) was obtained from Biolegend (San Diego, CA, USA). 
Flow cytometry acquisition was performed on a BD 
FACSVerse'" flow cytometer (BD, Franklin Lakes, NJ, USA) 
and flow cytometric data analysis was performed using 
FlowJo vX.0.7 (TreeStar, Ashland, OR, USA). 


Enzyme-linked immuno sorbent assay (ELISA) 

Plasma used for ELISA was isolated from blood by 
centrifugation (room temperature, 500 g, 10 min). Plasma was 
preserved at -80 °C until detection. Plasma sCD14, LBP and I- 
FABP levels were detected using commercially available 
ELISA kits obtained from R&D Systems (Minneapolis, MN, 
USA). 


Histopathological assessment 

Intestinal tissues fixed in 4% paraformaldehyde were 
dehydrated in a graded ethanol series and then embedded in 
paraffin. We collected 4 um thick paraffin sections, which were 
stained with hematoxylin and eosin (H&E). Intestinal injury 
was evaluated using the Chiu score system following the 
grading principles described by Chiu et al. (1970). 


Immunofluorescence 

Immunofluorescence was performed as described previously 
(Zhang et al, 2016a) The 4 um thick paraffin sections 
mentioned above were baked at 60 ? C for 2 h before 
dewaxing in xylene and rehydrating through a graded ethanol 
series. The sections were then washed in flowing water for 
more than 15 min. Antigen retrieval was performed using the 
high-pressure method. The sections were then naturally 
cooled to room temperature and washed in flowing water for 5 
min. For immunofluorescence, sections were treated with 1% 
sodium borohydride solution for 30 min and washed in 
phosphate buffered saline. After treatment with blocking 
buffer, sections were incubated with primary and secondary 
antibodies. The sections were then treated with Sudan Black 
for 20 min and washed with 50% ethanol. Mounting medium 
containing DAPI was used to adhere a coverglass to the 
microscope slide at last. Cross-reactive antibodies, including 
rabbit anti-claudin-3 antibody (polyclonal), mouse anti- 
cytokeratin antibody (clone MNF116), donkey anti-rabbit IgG 
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(Alexa Fluor®488, polyclonal), and donkey anti-mouse IgG 
(Alexa Fluor®555, polyclonal), were purchased from Abcam 
(Cambridgeshire, UK). Rabbit anti-E. coli antibody was 
purchased from Dako (Glostrup, Denmark). Microscopic 
examinations were performed on a Leica DMI4000B 
Microsystem (Leica Microsystems, Wetzlar, Germany). All 
slides were inspected, with photographs taken using the 
system. 


Statistical analysis 

Two-way ANOVA was used to compare frequencies of HLA- 
DR:CD8: T cells, ki67*CD8* T cells and plasma I-FABP levels 
between NPMs and ChRMs. The Mann-Whitney nonparametric 
test was used to compare pre-infection plasma sCD14, LBP 
and I-FABP levels and Chiu scores. Spearman's rank test 
was used to determine correlations between CD8* T cell 
activation and plasma sCD14, LBP and I-FABP levels in 
NPMs and ChRMs during SlVmac239 infection. All data 
analyses were performed using GraphPad Prism v6.01 
(GraphPad Software, San Diego, CA, USA). 


RESULTS 


NPMs have lower frequencies of activated and 
proliferating CD8* T cells than ChRMs during SIVmac239 
infection 

T cell activation, especially that of CD8* T cells, is more 
closely correlated with disease progression than plasma viral 
load or CD4* T cell count during HIV-1 infection (Giorgi et al., 
1999; Liu et al., 1998). In our previous study, we also found a 
close relationship between T cell activation and disease 
progression in SlVmac239-infected NPMs (Zhang et al., 
2017b). Therefore, here we compared the frequencies of 
activated and proliferating CD8* T cells in NPMs and ChRMs. 
The infection was divided into an acute and chronic phase at 
12 weeks post infection (Figure 1). The frequencies of HLA- 
DR*CD8* T cells and ki67*CD8* T cells in NPMs were both 
lower than those in ChRMs, regardless of acute or chronic 
phase, which indicated that CD8* T cell activation and 
proliferation levels of NPMs were lower than those of ChRMs 
during SIVmac239 infection (Figure 1A, B). 


NPMs have lower levels of plasma sCD14 and LBP than 
ChRMs during SlVmac239 infection 

As microbial translocation is an important cause of immune 
activation, we further analyzed the plasma biomarkers of 
microbial translocation, including sCD14 and LBP (Figure 2). 
We found no significant differences in pre-infection plasma 
SCD14 and LBP levels between NPMs and ChRMs (Figure 
2A). These results implied that there was no obvious 
difference in microbial translocation between NPMs and 
ChRMs in the absence of SIV infection. Plasma sCD14 levels 
in NPMs were lower than in those ChRMs during SIVmac239 
infection, which indicated that SlVmac239 infection might 
cause more severe microbial translocation in ChRMs than in 
NPMs (Figure 2B). However, there was no obvious difference 
in LBP levels between SlVmac239-infected NPMs and 
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Figure 1 Comparisons of CD8* T cell activation and proliferation between NPMs and ChRMs 

Frequencies of CD8*HLA-DR* (A) and CD8*ki67* (B) in peripheral blood T cells. Cubic spline curves were created to show the data trends. Infection 
was divided into acute phase and chronic phase at 12 weeks post infection. Two-way ANOVA was used to compare the frequencies of CD8*HLA- 
DR: and CD8*ki67* T cells between NPMs (black, n=6) and ChRMs (red, n=3). P-values represent differences between the two species. P«0.05 


was considered statistically significant. 


ChRMs (Figure 2C). Furthermore, plasma sCD14 levels were 
directly correlated with frequencies of HLA-DR*CD8* T cells in 
both NPMs and ChRMs, which implied that higher sCD14 
levels might contribute to higher CD8* T activation (Figure 
2D) |n contrast, plasma LBP levels were not directly 
correlated with the frequencies of HLA-DR*CD8* T cells in 
either NPMs or ChRMs (Figure 2E). Thus, not all microbial 
translocation-related products appear to be closely associated 
with immune activation. 


Intestinal mucosal injuries in ChRMs were more severe 
than in NPMs post SlVmac239 infection 
Microbial translocation mainly occurs in the intestine; thus, we 








focused on mucosal injuries in the colon and ileum. The 
normal NPMs had relatively intact intestinal mucosa, whereas 
the SlVmac239-infected ChRMs and NPMs presented varying 
degrees of damage (Figure 3A). We evaluated the severity of 
mucosal injury using the Chiu score system (Chiu et al., 
1970), which indicated that colon mucosal injuries in 
SIlVmac239-infected ChRMs were more severe than those in 
SlVmac239-infected NPMs, though there was no obvious 
difference in the Chiu score of the ileum mucosa (Figure 3B, 
C). In addition, colon and ileum mucosal injuries in 
SlVmac239-infected NPMs were more severe than those in 
the normal group. These results indicate that SIVmac239 
infection can damage intestinal mucosa and NPMs may be 
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Figure 2 Plasma microbial translocation biomarker levels and their correlations with frequencies of CD8*HLA-DR* T cells 

A: Pre-infection plasma sCD14 and LBP levels of NPMs (n=10) and ChRMs (n=8). Mann-Whitney nonparametric test was used to calculate P- 
values. P«0.05 was considered statistically significant. Two-way ANOVA was used compared sCD14 (B) and LBP (C) plasma levels during 
SIVmac239 infection. Plasma sCD14 and LBP levels in NPMs (n=6) and ChRMs (n= 3) from the same post-infection time points were compared. P< 
0.05 was considered statistically significant. Correlations between frequencies of CD8*HLA-DR* T cells and sCD14 (D) and LBP (E) plasma levels in 
NPMs (n=6) and ChRMs (n=3) were analyzed. Spearman rank correlation coefficients were calculated to assess correlations. P<0.05 was 


considered statistically significant. 
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less affected than ChRMs. 


NPMs maintained superior intestinal epithelial integrity 
post SlVmac239 infection 

Tight junctions between the intestinal epithelium play an 
important role in preventing translocation of gut microbe 
products from the lumen into the subepithelial region via the 
paracellular pathway (Milatz et al., 2010). Therefore, we 
detected the expression of tight junction protein claudin-3, 
which reflects the integrity of the intestinal epithelium. The 
intestinal epithelium in SIVmac239-infected ChRMs showed 
severe discontinuities in both the colon and ileum. In contrast, 
however, the intestinal epithelium of SlVmac239-infected 
NPMs showed superior continuities, as did normal NPMs 
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Figure 3 Intestinal mucosal injuries in the colon and ileum 


C, 


(Figure 4A). These results indicate that SIVmac239 infection 
may not severely affect the expression of tight junction protein 
claudin-3 in NPMs. 


Limited microbial translocation levels in SlVmac239- 
infected NPMs 

Although  SlVmac239-infected NPMs maintained superior 
intestinal epithelial integrity, whether the relatively intact 
intestinal epithelium was concomitant with lower microbial 
translocation was unclear. Therefore, we selected Escherichia 
coli as a representative microbial product and analyzed its 
infiltration into the intestinal epithelium (represented by 
cytokeratin). As expected, translocation of E. coli through the 
intestinal epithelium was obvious in SlVmac239-infected 
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A: Representative images (100x) of colon and ileum in normal NPMs (NPM Nor, n=4) and SIVmac239-infected ChRMs (ChRM SIV, n=4) and NPMs 
(NPM SIV, n=6). B: Colon Chiu scores. C: lleum Chiu scores. Chiu scores range from 0 to 5, with a higher score indicating more severe injury. Mann- 
Whitney nonparametric test was used to compare Chiu scores between different groups. P«0.05 was considered statistically significant. 
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Figure 4 Tight junctions and translocation of microbial products (E. coli) into intestinal lamina propria in the colon and ileum 

A: Representative images (100x) of epithelial barriers in the colon and ileum of normal NPMs (NPM Nor, n=4) and SIVmac239-infected NPMs (NPM 
SIV, n=6) and ChRMs (ChRM SIV, n=4). Tight junction protein claudin-3 (green) was stained to show the integrity of epithelial barriers. Blue is DAPI. 
White arrows show discontinuities, red arrows show continuities of epithelial barriers. B: Representative images (100x) of microbial translocation in 
the colon and ileum. Immunofluorescence was performed to stain DAPI (blue), E. coli (green), and cytokeratin (red). White arrows show 


translocation of microbial products (E. coli) into lamina propria. 


ChRMs, but rare in SIVmac239-infected or normal NPMs 
(Figure 4B). These results are consistent with the expression 
of the tight junction protein claudin-3. In summary, NPMs had 
limited microbial translocation due to superior intestinal 
epithelial integrity. 


DISCUSSION 


Microbial translocation is associated with systemic immune 
activation during HIV/SIV infection, which further accelerates 
disease progression (Brenchley et al., 2006; Canary et al., 
2013; Marchetti et al., 2011). 

The pig-tailed macaque is an Old-World monkey known to 
be susceptible to HIV-1 infection (Agy et al., 1992; Liao et al., 
2007). We previously identified a novel TRIM5-cyclophilin 
fusion gene (TRIMCyp) in the pig-tailed macaque that did not 
inhibit HIV-1 or SIV replication, thus explaining why the pig- 
tailed macaque is prone to HIV-1 infection (Kuang et al., 2009; 
Liao et al., 2007). Recently, three NPM virus models, i.e., HIV- 


Inesa HIV-1544, and stHIV-1sv, were established in our lab and 
a long-term viral reservoir was detected in these NPMs, which 
indicated that NPMs might serve as a potential model for HIV- 
1 reservoir research (Pang et al., 2017, 2018). In addition, we 
previously reported on various biological parameters of NPMs 
(Lian et al., 2016, 2017, 2018; Song et al., 2018; Zhang et al., 
2014, 2016b, 2016c; Zheng et al., 2014b; Zhu et al., 2015), 
thus promoting the application of NPMs in AIDS and other 
biomedical research. SPMs are appropriate research models 
due to their compromised intestinal integrity and increased 
microbial translocation in the absence of SIV infection (Canary 
et al., 2013; Klatt et al., 2010). In the present study, however, 
we found lower levels of CD8* T cell activation in SIVmac239- 
infected NPMs than in ChRMs, which was partly attributed to 
superior intestinal integrity and limited microbial translocation. 
These results imply that the intestinal microenvironment of 
NPMs may differ from that of SPMs. Thus, further study on the 
differences and potential mechanisms may provide new 
insight into the therapeutic strategies of microbial 
translocation during HIV infection. 
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Several plasma biomarkers are commonly selected to 
evaluate the level of microbial translocation, including LPS, 
sCD14, LBP, and EndoCAb (endotoxin-core antibody) 
(Brenchley et al., 2006; Marchetti et al., 2013). In contrast to 
uninfected individuals, HIV-infected progressors have 
significantly higher levels of plasma sCD14 and LBP and 
significantly lower levels of EndoCAb (Brenchley et al., 2006). 
In the present study, we also observed direct correlations 
between plasma sCD14 levels and CD8* T cell activation in 
SIVmac239-infected NPMs and ChRMs, which implied that 
microbial translocation was closely associated with immune 
activation (Figure 2D). However, plasma biomarkers do not 
consistently reflect the real level of microbial translocation and 
limitations do exist (Marchetti et al., 2013). Here, we did not 
observe direct correlations between plasma LBP levels and 
CD8* T cell activation in either NPMs or ChRMs (Figure 2E). 
Previous research has demonstrated that gut commensal 
microbes are not the main contributors to circulating CD4* T 
cell activation (Zimmermann et al., 2015). Furthermore, CD8* 
T cells may be activated in an antigen-independent manner 
that is not associated with the level of microbial translocation- 
related products (Bastidas et al, 2014). Therefore, the 
relationship between certain plasma microbial translocation 
biomarkers and immune activation needs further confirmation. 

Histopathological assessment provides the most direct 
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evidence of intestinal mucosal injuries. In the present study, 
superior intestinal integrity was only coincident with lower 
plasma sCD14 levels in NPMs. These results indicate that 
compromised intestinal integrity may cause severe microbial 
translocation, followed by systemic immune activation, which 
accelerates disease progression. Furthermore, microbial 
translocation resulting from breakdown of intestinal mucosa 
may result in microbial product translocation into draining or 
distant lymph nodes, followed by persistent local immune 
activation (Estes et al., 2010). Therefore, intestinal mucosal 
damage appears to be the fundamental cause of microbial 
translocation and subsequent immune activation. 

To assess the level of intestinal injury, fatty acid-binding 
proteins are often used as plasma biomarkers of tissue injury 
(Pelsers et al., 2003, 2005). In the present study, we selected 
intestinal-type fatty acid-binding protein (I-FABP) as the 
plasma biomarker of intestinal injury, as described elsewhere 
(Jenabian et al., 2015; Pelsers et al., 2003). As the plasma l- 
FABP levels in most individuals were below the lower limit, we 
did not find any significant differences in plasma I-FABP levels 
between NPMs and ChRMs either prior to or post infection 
(Figure 5A, B). Further study showed that plasma I-FABP 
levels were directly associated with CD8* T cell activation in 
NPMs, but not in ChRMs (Figure 5C, D). In summary, 
although the plasma I-FABP level was correlated with immune 
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Figure 5 Plasma I-FABP levels and their corrections with CD8* T cell activation 

A: Pre-infection plasma I-FABP levels. Mann-Whitney nonparametric test was used to calculate P-values. P«0.05 was considered statistically 
significant. B: Plasma I-FABP levels during SlVmac239 infection. Two-way ANOVA compared the plasma I-FABP levels of NPMs and ChRMs. P< 
0.05 was considered statistically significant. C, D: Correlations between CD8* T cell activation and plasma I-FABP levels in NPMs and ChRMs 
during SlVmac239 infection. Spearman’s rank test was used to determine correlations. P«0.05 was considered statistically significant. 
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activation to some extent, I-FABP may not fully reflect the 
level of intestinal injury or microbial translocation. 

There are some limitations in the present study. First, it is 
difficult to obtain absolutely accurate results using such a 
small number of animals. Given the differences between each 
macaque, more experimental animals would improve 
accuracy. In addition, infection time varies between 
individuals, which may also affect pathological consequences. 

In the present study, we characterized the relationship 
between immune activation and microbial translocation in 
SIVmac239-infected NPMs for the first time. Intriguingly, 
NPMs maintained superior intestinal integrity and limited 
microbial translocation after SIV infection, which was different 
from that observed in ChRMs and SPMs. How NPM intestinal 
mucosa avoids damage from SIV infection is still an enigma 
that needs to be resolved. Research on intestinal mucosal 
immunity in NPMs is likely to provide guidance for therapeutic 
strategies of microbial translocation in HIV-1-infected patients. 
In addition, although pig-tailed macaques are susceptible to 
HIV-1 infection, how HIV-1 affects regional immunity in the 
intestinal tract of pig-tailed macaques is still unclear (Kuang et 
al., 2009; Liao et al., 2007; Pang et al., 2017). Studies on this 
problem may provide deeper insight into the mechanisms of 
microbial translocation during HIV-1 infection. 
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ABSTRACT 


Elucidating the closest living relatives of extant 
primates is essential for fully understanding 
important biological processes related to the 
genomic and phenotypic evolution of primates, 
especially of humans. However, the phylogenetic 
placement of these primate relatives remains 
controversial, with three primary hypotheses 
currently espoused based on morphological and 
molecular evidence. In the present study, we used 
two algorithms to analyze differently partitioned 
genomic datasets consisting of 45.4 Mb of 
conserved non-coding elements and 393 kb of 


concatenated coding sequences to test these 
hypotheses. We assessed different genomic 
histories and compared with other molecular 


studies found solid support for colugos being the 
closest living relatives of primates. Our phylogeny 
showed Cercopithecinae to have low levels of 
nucleotide divergence, especially for Papionini, and 
gibbons to have a high rate of divergence. The 
MCMCtree comprehensively updated divergence 
dates of early evolution of Primatomorpha and 
Primates. 
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INTRODUCTION 


The advancement of genomics has ushered an era in which 
genetic studies on the evolution of adaptively complicated 
traits have become possible. This is especially interesting for 
humans, non-human primates, and their closest relatives. 
However, such assessment requires a robust hypothesis of 
phylogenetic relationships. To date, the phylogeny of primates 
and their closest relatives remains a subject of intense 
debate. Analyses of different morphological or molecular 
datasets show conflicting topologies, most likely due to the 
accelerated evolution of the Euarchonta, which includes the 
Primates, Southeast Asian flying lemurs (Dermoptera), and 
tree shrews (Scandentia). Furthermore, apparent incomplete 
lineage sorting (ILS) can drive conflict between estimations of 
species trees and gene trees (Murphy et al., 2007; Nishihara 
et al., 2006; Scornavacca & Galtier, 2017). 

Massive parallel data have driven advancements in 
genomics and associated methods (Liu et al., 2010; Mirarab 
et al., 2014a, 2014b) and can be applied to hypothesize 
species trees. Analyses using a greater number of genes or 
diverse data can be used to parse credible species trees, 
such as that for modern birds (Jarvis et al, 2014). 
Longstanding hypotheses based on morphological and 
molecular proof have suggested a sister-group relationship 
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between primates and tree shrews (Kay et al., 1992; Novacek, 
1992; Wible & Covert, 1987). In addition, several other studies 
have regarded both tree shrews and flying lemurs (colugos) 
together as a sister group of primates (Bloch et al., 2007; Liu 
et al., 2001; Murphy et al., 2001; Nie et al., 2008; Sargis, 
2002; Springer et al., 2003, 2004). Further phylogenetic 
analysis incorporating paleontological evidence also suggests 
that primates and colugos are sister taxa (Beard, 1993). 
Previous molecular studies also support colugos as the 
closest living relatives of primates (Bininda-Emonds et al., 
2007; Hudelot et al., 2003; Waddell et al., 2001). Genomic 
analyses further postulated a third potential topology: 
((primates, colugos), tree shrews) (Janecka et al., 2007; 
Perelman et al., 2011), though this was based on analyses of 
limited genomic changes (insertion and deletions, InDels) and 
few nuclear gene fragments. Furthermore, these analyses did 
not exclude biases due to ILS, data partitioning, or data 
insufficiency (Chojnowski et al., 2008; Patel et al., 2013; 
Rokas et al., 2003). 

Whole genome data with a well-preserved evolutionary 
history for the genome of each species can be used to 
recover an exact species tree (Jarvis et al., 2014; Mitchell et 
al., 2015; Rokas et al., 2003; Wolf et al., 2002). Herein, we 
inferred the closest living relatives of primates and estimated 
the time scale for extant Euarchonta based on 23 existing 
genome datasets and an out-group (Mus musculus). Analyses 
relied on two types of data: 393 kb of concatenated protein- 
coding sequences from 706 one-to-one orthologous genes 
(OOGs) and 45.4 Mb of concatenated conserved non-coding 
elements (CNEs). 


MATERIALS AND METHODS 


Identification of OOGs and sequence alignment/trimming 
analyses 

We downloaded all protein-coding sequences of 20 species 
(primates, colugos, and tree shrews) (Table 1) from the NCBI 
assembly database (https://www. ncbi. nim. nih. gov/). Data for 
Homo sapiens, Pan troglodytes, Tupaia belangeri, and Mus 
musculus were downloaded from Ensembl 88 (http://www. 
ensembl. org/info/data/ftp/). The dataset with the longest 
coding sequence (CDS) of each gene for each species was 
retained and then translated into proteins to detect paired 
OOGs using InParanoid-4.1 (Remm et al., 2001) (parameters: 
$score_cutoff=40, $conf_cutoff=0.05, $group_overlap_cutoff= 
0.5, $seq_overlap_cutoff=0.5 and $segment_coverage_cutoff= 
0.25). Multiple species OOGs were captured by best 
reciprocal intersections. Prank v100802 (Loytynoja & 
Goldman, 2005, 2008, 2010) (parameters: -f=fasta -F —codon — 
noxml — notree — nopost) and Gblocks v0.91b (Castresana, 
2000; Talavera & Castresana, 2007) (parameters: —t=c —b4= 
5 —b5=n) were used to align orthologous regions and to trim 
alignments with low-quality regions for each OOG, 
respectively. 


Pairwise and multiple whole genome alignments 
The pairwise whole genome alignments of Pan troglodytes, 


Gorilla gorilla, Macaca mulatta, Microcebus murinus, 
Galeopterus variegatus, and Mus musculus vs. Homo sapiens 
were downloaded from the University of California Santa Cruz 
(UCSC) pairwise alignments. We also downloaded repeat- 
masked whole genome sequences (17 species) from the 
NCBI assembly database, with the repeat-masked Homo 
sapiens genome obtained from the UCSC Genome Browser 
(http://genome.ucsc.edu/). Pairwise whole genome alignments 
for 17 species (Table 1) vs. Homo sapiens were obtained 
using LASTZ (Schwartz et al., 2003) with the following 
parameters: E=30, O=400, K=3 000, L=2 200, and M=50. A 
24-way whole genome multiple alignment was then generated 
using Multiz v11.2 (Blanchette et al., 2004) with default 
parameters, and with Homo sapiens regarded as the 
reference taxon. 


Detection of CNEs and pre-processing 

Phastcons (Siepel et al., 2005) was utilized to obtain elements 
with conserved scores under a given multiple whole genome 
alignment file and phylo-HMM. The phylo-HMM analysis 
assumed a conserved and non-conserved state. Modifications 
of parameters were: --target-coverage 0.3, --expected-length 
45, and --rho 0.31. The phylogenetic model for non-conserved 
regions was produced by phyloFit in the PHAST package 
(Hubisz et al., 2011) (http://compgen. cshl. edu/phast/). We 
then used an in-house Python script to extract the CNEs 
from multiple sequence alignments across multiple genomes. 
Our screening required that the CNEs fulfilled three criteria: 
(1) alignments with gaps in at least one of the 24 species 
were deleted in this study; (2) multiple sequence alignments 
overlapped with conserved non-coding regions; and (3) length 
of overlapping sequences exceeded 10 bp. 


Estimates of phylogeny, 
divergence time 

Codon positions 1, 2, and 1+2 of each filtered OOG among 
the 24 species were extracted and concatenated. All CNEs 
were concatenated and trimmed using Gblocks (Castresana, 
2000; Talavera & Castresana, 2007). The maximum likelihood 
(ML) trees using the concatenated genes were reconstructed 
using RAxML v8.1.17 (Stamatakis, 2014) with the GTR+ 
GAMMA substitution model and 1 000 bootstraps. We used 
1 000 rapid bootstrap replicates to assess branch reliability. 
Modeltest (Posada & Crandall, 1998) was selected to detect 
the best substitution model and MrBayes v3.2.6 (Huelsenbeck 
& Ronquist, 2001) was used to reconstruct a Bayesian 
inference (Bl) tree. The chain length was set to 10 000 000 
(10 000 generation/sample), with the first 100 000 samples 
discarded as burn-in. The reconstructed single-gene trees 
with a root (M. musculus) from RAxML v8.1.17 (Stamatakis, 
2014) were applied using maximum  pseudo-likelihood 
estimation of the species tree (MP-EST) (Liu et al., 2010; 
Shaw et al., 2013) with default parameters to estimate a 
species tree based on a multispecies coalescent model. This 
approach was assumed to resolve conflicts between 
concatenated and coalescent species trees if ILS existed in 
our dataset. Relative genomic divergence (nucleotide 


genomic divergence, and 
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Table1 List of taxa, including primates, colugos, tree shrews, and rodents, used in our analyses 


Species 

Homo sapiens 

Pan troglodytes 

Gorilla gorilla 

Pongo abelii 
Nomascus leucogenys 
Macaca mulatta 
Macaca nemestrina 
Papio anubis 
Cercocebus atys 
Mandrillus leucophaeus 
Chlorocebus sabaeus 
Colobus angolensis 
Rhinopithecus roxellana 
Aotus nancymaae 
Callithrix jacchus 
Cebus capucinus 
Saimiri boliviensis 
Carlito syrichta 
Propithecus coquereli 
Microcebus murinus 
Otolemur garnettii 
Galeopterus variegatus 
Tupaia belangeri 


Mus musculus 


Common name Taxonomy 

Human Hominoidea 
Chimpanzee Hominoidea 
Western lowland gorilla Hominoidea 
Sumatran orangutan Hominoidea 
Northern white-cheeked gibbon Hominoidea 


Rhesus monkey 
Pig-tailed macaque 
Olive baboon 
Sooty mangabey 
Drill 

Green monkey 


Angolan colobus 


Golden snub-nosed monkey 


Ma’s night monkey 


White-tufted-ear marmoset 


White-faced sapajou 


Bolivian squirrel monkey 


Philippine tarsier 
Coquerel’s sifaka 
Gray mouse lemur 
Small-eared galago 
Sunda flying lemur 
Northern tree shrew 


House mouse 


substitutions/site) was visualized as branch lengths in the 
concatenated species phylogeny. 

We estimated divergence time using MCMCtree (dos Reis 
& Yang, 2011; Yang, 2007), with five fossil calibration points 
used to determine approximate likelihood calculations (Fan et 
al., 2013; Franzen et al., 2009; Matsui et al., 2009; Poux & 
Douzery, 2004; Vignaud et al., 2002). Using MCMCtree (dos 
Reis & Yang, 2011; Yang, 2007), we obtained 10 000 trees 
with a sampling frequency of 50 and burn-in of the first 10 000 
iterations. The parameters were modified as follows: 'clock=3', 
'model-0', 'alphaz0', 'ncatG=5', 'cleandata=1', 'BDparas=1 1 
0', 'Kkappa gamma-6 2', 'aloha gamma-z1 1', 'rgene gamma- 
2 2', 'sigma2_gamma=1 10', and ‘finetune=1: .00356 0.02243 
0.00633 0.12 .43455'. All other parameters were set to default. 
Tracer (http://beast. bio. ed. ac. uk/Tracer) was used to detect 
convergence. 


RESULTS AND DISCUSSION 


Coding data and phylogeny based on coding partitioned 
models 

We identified 738 OOGs among the 24 species (Table 1). 
After multiple sequence alignments and removal of 
ambiguous sites/regions for each gene, 706 OOGs were 
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Old World monkey 
Old World monkey 
Old World monkey 
Old World monkey 
Old World monkey 
Old World monkey 
Old World monkey 
Old World monkey 
New World monkey 
New World monkey 
New World monkey 
New World monkey 
Tarsiiformes 
Strepsirrhini 
Strepsirrhini 
Strepsirrhini 
Dermoptera 
Scandentia 


Rodentia 


retained. Because rapidly evolving nucleotide sites are 
phylogenetically less informative than slowly evolving ones, 
especially for ancient groups (Kàlersjó et al., 1999), we 
deleted the third codon positions for each OOG. The 
remaining data were concatenated and partitioned by codon 
position as follows: codon position 1 (196 501 bp), position 
2 (196 501 bp), and positions 1+2 (393002 bp). We 
employed RAxML v8.1.17 (Stamatakis, 2014) to determine 
phylogeny based on ML algorithm with using the GTR+ 
GAMMA substitution model. The topologies of the three 
consensus trees potentially resolved the deep-branch 
phylogeny of primates. The representative — colugos 
constituted a sister group of primates, which together 
formed the Primatomorpha. Compared to the concatenated 
codon 1 and codon 2 sequences, the concatenated codons 
1+2 provided greater support for all nodes. Only one node 
did not have 100% bootstrap support (BS) (Figure 1). The 
BI analysis was consistent with the sequence data of the 
concatenated codons 1+2, with high Bayesian posterior 
probability (BPP) for each node. 

Phylogenetic analyses of the three data-partitions were 
concordant with the ML and BI topologies for Primatomorpha. 
However, complicated biological processes, such as ILS, can 
influence the reconstruction of species trees as different 
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Figure 1 Reconstructed phylogenies based on concatenated sequences from first, second, and first-second codon 


respectively, of 706 OOGs 


positions, 


(A) and (B) were based on maximum likelihood and (C) and (D) were based on maximum likelihood and Bayesian inference, respectively. Common 


ancestry branch of primates and colugos was marked by red arrow. 


protein-coding genes may have divergent 
histories during the early diversification of Euarchonta 
(Murphy et al, 2007; Nishihara et al. 2006) Thus, 
concatenated gene sequences may result in a false-positive 
species tree with higher nodal BS or BPP (Kubatko & Degnan, 
2007). Thus, we estimated the best topology of each gene 
tree based on the ML algorithm across 24 species, and 
discovered that discordance between the gene and species 
trees indeed existed in protein-coding genes in our dataset, 
which included 100 gene trees (14.196) supporting tree 
shrews as the closest living relatives of primates and 25 gene 
trees (3.596) supporting both tree shrews and colugos as a 
sister group of primates. However, we identified a higher 
proportion of genes (130 genes, ~18.4%) supporting colugos 
as the closest living relatives of primates (Supplementary 
Figure S1). Multispecies coalescence can resolve 
discordance between gene and species trees, even in the 
presence of ILS (Mirarab et al., 2014a; Song et al., 2012; 
Zhong et al., 2013). Therefore, we used MP-EST (Liu et al., 
2010; Shaw et al., 2013) to construct a coalescent phylogeny. 
The method maximizes a pseudo-likelihood function to infer a 
species tree with 1 000 bootstrap replicates. The topology of 
the coalescent species tree was concordant with the one 
using the concatenated method. The deep node at the split of 
colugos and primates had moderate BS (7796) and the node 
for the last common ancestor of Callithrix jacchus and Aotus 
nancymaae had low BS (55.6%) (Supplementary Figure S2). 
For the phylogeny of primates, the species tree topology 
based on the concatenated data was consistent with that of 


evolutionary 


the coalescent data. 


Species tree based on conserved non-coding elements 

Protein-coding regions only reflect a part of a genome's 
evolutionary story. Most genome sequences are from non- 
coding regions. Previous studies have shown that CNEs have 
important biological functions, including development and 
transcriptional regulation (Mikkelsen et al., 2007; Woolfe et al., 
2005). We used LASTZ (Schwartz et al., 2003) to perform 
pairwise genome alignments with human as the reference, 
and then used MULTIZ v11.2 (Blanchette et al., 2004) to 
obtain whole genome alignments across the 24 species. 
Phastcons (Siepel et al, 2005) identified 215.6 Mb of 
concatenated CNEs, with Gblocks v0.91b (Castresana, 2000; 
Talavera & Castresana, 2007) (parameters: —t=d —b4=5 —b5= 
n) then used to trim the data for high quality. These analyses 
obtained 45.4 Mb of data across all 24 species. The RAxML 
v8.1.17 (Stamatakis, 2014) analyses of the CNEs produced a 
phylogeny using the GTR+GAMMA substitution model and 
1 000 bootstrap replicates. Each of the nodes for the potential 
species tree arrived at a 10096 BS (Figure 2A). This analysis 
also supported the hypothesis that colugos are the closest 
living relative of primates. The tree depicted a very short 
branch length from the last common ancestor of primates and 
colugos, suggesting that the last common ancestor 
experienced rapid radiation. This discovery may explain, at 
least in part, why the sister group of primates is controversial. 
Analyses using mitochondrial and nuclear sequences did not 
obtain this result (Murphy et al., 2001; Springer et al., 2003), 
whereas the globally concatenated data from codons 1+2 plus 
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Figure 2 Maximum likelihood tree (ML) and genomic divergences based on conserved non-coding elements (CNEs) 
A: CNE ML tree. B: Genomic divergences (nucleotide divergence) of primates. Carlito syrichta was excluded because of too few species in 


Tarsiiformes. Differential groups were marked by blue lines. 


CNEs inferred a consistent species tree (Supplementary 
Figure S3). 


CNEs and coding trees vs. InDels and nuclear gene trees 

Compared to limited morphological, mitochondrial, and 
nuclear gene assessments, high-quality whole genomes and 
robust algorithms can identify, alter, and extend previous 
taxonomic conclusions. Our phylogenetic analyses, rooted 
using a rodent (Mus musculus), covered the main branches of 
primates for which genomes exist. Many factors, such as gene 
duplication, admixture, introgression, hybridization, 
recombination, convergent evolution, and natural selection, 
can influence the reconstruction of a true species tree 
(Degnan & Rosenberg, 2009; Rokas et al, 2003; 
Scornavacca & Galtier, 2017). However, ILS has often been 
invoked to explain conflicts between species and gene trees. 
Many phylogenetic studies examining conflicting signals of 
different genes have reported considerable discordance 
across gene trees, e.g., in fruit flies (Pollard et al., 2006) and 
finches (Jennings & Edwards, 2005). Here, our concatenated 
and coalescent models supported the hypothesis that colugos 
are the extant sister group of primates, which together form 
the — Primatomorpha. Concatenated sequences have 
potentially more informative sites. The coalescent model fills 
in discordances between the estimated species tree and gene 
tree once ILS occurs. All methods of analyses in the current 
study considered different data partitions (protein-coding 
genes and non-coding sequences), but stil obtained a 
consistent species tree. Our results were in concordance with 
the hypothesis that colugos are the closest living relatives of 
primates but refuted the hypotheses that tree shrews or tree 
shrews plus colugos are the sister group of primates. These 
observations provide similar insight as two classical molecular 
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studies, which utilized InDels in genomes (Janecka et al., 
2007) and 54 sequenced nuclear genes to infer the primate 
species tree (Perelman et al., 2011). However, our analyses 
still cannot exclude important influences of many other 
complex evolutionary histories, such as ancient admixture, 
introgression, or other potential factors. Recent lines of 
evidence have indicated that ancient gene flows may 
potentially underpin phylogenetic discordances (de Manuel et 
al., 2016; Kuhlwilm et al., 2019; Li et al., 2016; Melo-Ferreira 
et al., 2014; Nater et al., 2017; Rogers et al., 2019; Tung & 
Barreiro, 2017; Wu et al., 2018). Therefore, future population 
genomic studies with deeper coverage and advanced 
methods need to validate this possibility in clades. 


Estimates of genomic divergence in primates 

Primate phylogeny can be used to estimate the relative levels 
of genomic divergences based on branch lengths. The branch 
uniting Homininae and Pongo abelii separated them from 
Nomascus leucogenys, which was assigned to Hylobatidae. 
This short branch length indicated that Hominoidea radiated 
rapidly as ((Ponginae, Homininae), Hylobatidae). Overall, 


primate genomes have varying levels of genomic 
divergences. According to previous studies, gibbons 
(Hylobatidae) have an extremely high proportion of 


chromosomal rearrangement (Jauch et al., 1992; Muller et al., 
2003; Perelman et al., 2011). Our analyses indicated that 
Nomascus leucogenys had the highest rate of genomic 
divergence (32.3 substitutions/sitex10^) among species of 
Hominoidea. This was almost equivalent to the mean 
nucleotide divergence (32.6 substitutions/sitex10^) of all 
primate genomes (Figure 2B). The results of a previous study 
(Perelman et al., 2011) and our research suggest that this 
may be a consequence of genomic rearrangements within 


Nomascus leucogenys; however, this needs additional 
investigation. Our results also showed that Old World 
monkeys had the lowest genomic divergence, and this 
contrasted with their sister group, Hominoidea and other 
primates. Low levels of divergence were specific 
characteristics of Papionini, including Macaca mulatta, 
Macaca nemestrina, Mandrillus leucophaeus, Cercocebus 
atys, and Papio anubis. Speciation patterns of Old World 
monkeys are remarkably misleading due to convergent 
morphological traits, behavior, and reproduction, as well as 
sympatric hybridization (Perelman et al., 2011). Here, the 
rapid speciation and radiations in Papionini may have involved 
in reticulate evolution via natural hybridization (Arnold & 
Meyer, 2006). 


Early evolutionary dates of Primatomorpha and Primates 

The timing and early evolution of Primatomorpha and 
Primates are intriguing. A MCMCtree approach (dos Reis & 
Yang, 2011; Yang, 2007) with five fossil constraints (Fan et al., 
2013; Franzen et al., 2009; Matsui et al., 2009; Poux & 
Douzery, 2004; Vignaud et al., 2002) was used to estimate 
evolutionary dating (Figure 3). According to previous methods 
(Liu et al., 2017; Reis et al., 2018), we generated a plot of the 
posterior distributions to estimate the accuracy of our 
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divergence dating analyses for calibrated nodes, compared to 
a previous primate phylogeny study (Perelman et al., 2011) 
and TimeTree (http://www. timetree. org/) (Supplementary 
Figure S4). Our analyses were consistent with their results 
and thus ensured that our analyses were reliable. Our results 
suggested the origin of Primatomorpha at ~88.0 million years 
ago (Mya), Primates at ~77.4 Mya, Strepsirrhini at ~70.6 Mya, 
and Tarsiiformes at ~65.0 Mya (Figure 3). Previously, groups 
at the base of Euarchonta have been difficult to resolve due to 
the rapid radiation of ancestral lineages before the 
Cretaceous-Tertiary (K/T) boundary (~65 Ma) (Wible et al., 
2007) and limited sampling of genes and taxa (Janecka et al., 
2007; Springer et al., 2003). The estimated dates of origins of 
Primates, Strepsirrhini, and Tarsiiformes are similar to 
previous inferences based on nuclear and mitochondrial 
genes (Jameson et al, 2011; Pozzi et al., 2014). This 
consistency suggests that our estimates are potentially 
reliable. These events predated the K/T boundary (Wible et 
al., 2007). Short branch lengths occurred for the last common 
ancestor of colugos and primates, as well as for Tarsiiformes 
and Simiiformes in the CNE tree (Figure 2A). Such rapid 
divergences may explain why some morphological and 
molecular studies (Bloch et al., 2007; Sargis, 2002) fail to 
resolve the placements of colugos and tarsiers. Analyses also 
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Figure 3 Estimated divergence dates of Euarchonta 
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Dashed vertical line denoted K/T boundary (~65 Mya). To obtain estimated divergence time, five calibration points with letters a, b, c, d, and e were 
applied as normal priors to constrain nodal ages. Five red solid points represented fossil constraints, including direct and indirect fossil records: a: 
mean=6.5 Mya, stdev=0.8 for time to most recent common ancestor (TMRCA) for Homo-Pan (Vignaud et al., 2002); b: mean=15.5 Mya, stdev=2.5 


for TMRCA of Homininae-Ponginae (Matsui et al., 2009); c: mean=29.0 M 


ya, stdev=6.0 for TMRCA of Catarrhini (Poux & Douzery, 2004); d: mean= 


43.0 Mya, stdev=4.5 for TMRCA of Catarrhini-Platyrrhini (Franzen et al., 2009; Poux & Douzery, 2004); e: median=90.9, error range (80.6 - 106.2) 
for TMRCA of Scandentia-Primatomorpha (Fan et al., 2013). All nodal ages were indicated by medians (red font) and 95% highest posterior density 


(HPD) intervals (blue bars). 
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estimated that extant strepsirrhines arose ~53.2 Mya, based 
on three species (Propithecus coquereli, Microcebus murinus 
and Otolemur garnettii), which covered two main groups of 
Strepsirrhini: Lemuriformes and Lorisiformes. 


CONCLUSIONS 


Our analyses provide a potentially robust assessment of the 
closest living relatives and divergence times of living primates. 
The improved conserved non-coding elements and partitioned 
protein-coding sequences will facilitate further comprehensive 
studies into the biological processes related to the genomic 
and phenotypic evolution of primates, especially of humans. 
Furthermore, our analyses provide a useful resource for the 
phylogenetic reconstruction of ancient groups. 
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ABSTRACT 


The systematics of Semisulcospiridae in China is 
revised here based on morphological characters and 
mitochondrial phylogenetics. Phylogenetic 
relationships within the Chinese semisulcospirids 
were assessed via DNA sequences from 
mitochondrial analysis (cytochrome c oxidase | and 
16S rRNA) This research contains most 
morphospecies of  semisulcospirids previously 
recorded in China. Based on these results, the 
family of Chinese Semisulcospiridae is represented 
by three genera: i. e., viviparous Semisulcospira 
Bottger, 1886, oviparous Hua Chen, 1943 and 
Koreoleptoxis Burch and Jung, 1988. These genera 
can be distinguished from each other by 
reproductive anatomy, reproductive mode, and 
radula features. Species of Hua are mainly 
distributed in southwest China and Guangxi, 
whereas Koreoleptoxis and Semisulcospira are 
mainly distributed in south and northeast China. 


Keywords: Phylogeny; China; Semisulcospira; Hua; 
Koreoleptoxis 
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INTRODUCTION 
Semisulcospirids are an  ecologically important and 
taxonomically challenging family of freshwater  snails 


distributed in lakes, rivers, and streams in eastern Asia, 
including Far East Russia, Japan, Korean peninsula, and 
China, and in western North America (Kohler, 2016, 2017; Liu 
et al, 1993; Strong & Köhler, 2009; Yen, 1939). Basic 
knowledge on freshwater gastropods in China was first 
established in the late nineteenth and early twentieth 
centuries (e.g., Dautzenberg & Fischer, 1906, 1908; Fulton, 
1914; Heude, 1888), with all cerithioidean freshwater 
gastropods designated under the single genus Melania 
Lamarck, 1799 (junior synonym of Thiara Róding, 1798). 
Heude (1888, 1889 (1882 — 1890)) recorded 24 species of 
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Melania in south China, especially in the Yangtze River Basin. 
Yen (1939) subsequently identified Chinese freshwater snails 
from the Naturmuseum Senckenberg, Germany, and recorded 
12 Chinese species of Thiaridae and a new genus, 
Senckenbergia Yen, 1939, for the type species Melania 
pleuroceroides from Sichuan. Chen (1943) later described two 
genera, Wanga (type species Melania henriettae Gray, 1833) 
and Hua (type species Melania telonaria Heude, 1888), into 
which were placed eight and eleven nominal species, 
respectively. In dispute, Morrison (1954) proposed Wanga and 
Hua to be synonyms of Brotia H. Adams 1866 and Oxytrema 
Rafinesque 1819, respectively, although the suggestion of 
Hua being a synonym of Oxytrema was not accepted by 
subsequent authors (Du et al., 2019; Kóhler, 2016, 2017; 
Strong & Kóhler, 2009). Abbott (1948) described a new genus, 
Namrutua, for the type species Melania ningpoensis. 
However, Senckenbergia and Namrutua were primarily 
ignored by most Chinese researchers, with all species instead 
placed in Semisulcospira Böttger, 1886 (Liu et al., 1979, 1993; 
Xu, 2007; Zhang et al, 1997) Several species of 
Semisulcospira were subsequently described in China, 
including S. inflata Tchang & Tsi 1949 recorded in Lake 
Dianchi, Yunnan, and S. marica Li, Wang & Zhang 1994 and 
S. crassicosta Liu, Wang & Zhang 1994 recorded in Guizhou 
Province (Liu et al., 1994; Tchang & Tsi, 1949). Liu et al. 
(1994) also descried two new species of Paludomus, P. 
gianensis Liu Zhang & Duan 1994 and P cinctus Liu, Zhang & 
Duan 1994, in Guizhou Province. Xu (2007) stated that there 
were 53 species of Semisulcospiridae found in China, 
including 49 species of Semisulcospira, two species of Hua, 
one species of Paludomus, and one species of Potodoma. 
However, Xu (2007) did not revise the validity of these 
species. The first systematic revision of semisulcospirids was 
provided by Strong & Kóhler (2009) based on morphological 
characters and molecular phylogenetic analysis. They showed 
Hua to be a valid genus and distinguishable from 
Semisulcospira by distinct features of the digestive, 
reproductive, and nervous systems. Du et al. (2017, 2019) 
designated Semisulcospira trivolvis Yen 1939 as synonym to 
S. paludiformis Yen 1939 and further stated that it should be 
assigned to the genus  Sulcospira Troschel, 1858 
(Pachychilidae Fischer & Crosse, 1892). More recently, Du et 
al. (2019) studied semisulcospirids of southwest China by 
morpho-anatomy and mitochondrial phylogenetics and 
supported Namrutua and Senckenbergia as viviparous and as 
synonyms of Semisulcospira. Additionally, the taxonomic 
status of several species were discussed, in particular M. 
cancellata Benson 1842 and M. suifuensis Chen 1937 were 
treated as junior synonyms to S. ningpoensis (|. Lea, 1856), 
and M. dulcis Fulton 1904, M. lauta Fulton 1904, and S. inflata 
Tchang 1949 were treated as junior synonyms to Hua textrix 
(Heude, 1888). However, several semisulcospirid species 
were not contained in Du et al. (2019), including H. 
kweichouensis (Chen, 1937) (synonym to H. intermedia 
(Gredler, 1885) in this study), Paludomus minensis Chen 1943 
(synonym to Lithogryphus pallens Bavay & Dautzenberg 
1910, valid species name is H. pallens in this study), and P. 
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rotundata (Heude, 1888). Furthermore, most semisulcospirids 
were recorded in the Yangtze River. Thus, based on previous 
literature, 52 species of semisulcospirids have been recorded 
in China (Du & Yang, 2019; Du et al., 2019; Liu et al., 1993; 
Xu, 2007). However, the systematic and phylogenetic 
relationships of semisulcospirids remain poorly understood. 
From 2014 to 2017, specimens of semisulcospirids were 
collected from 14 Chinese provinces, including Heilongjiang, 
Jilin, Liaoning, Anhui, Jiangxi, Hunan, Zhejiang, Fujian, 
Guangdong, Guangxi, Guizhou, Sichuan, Chongqing, and 
Yunnan. Based on morphological and molecular analyses, the 
specimens were used to clarify the phylogenetic and 
systematic relationships among Chinese semisulcospirids. 


MATERIALS AND METHODS 


Taxon sampling 

Specimens were collected from 2014 to 2017 by L.N. Du et al. 
in 14 Chinese provinces (Figure 1). The foot muscle tissues 
were preserved in 99% ethanol. All voucher specimens were 
deposited at the Kunming Institute of Zoology, Chinese 
Academy of Sciences (KIZ, CAS) (Table 1). The identification 
of specimens included verification of origin descriptions and 
topotypic specimen photos. 

We examined 225 sequenced samples from 41 species, 
including 95 sequenced samples from 22 species of Hua from 
Yunnan, Chongqing, Guangxi, and Guizhou, 90 sequenced 
samples from 12 species of Koreoleptoxis from Heilongjiang, 
Jilin, Hunan, Jiangxi, Fujian, Zhejiang, and Anhui, and 40 
sequenced samples from seven species of Semisulcospira 
from Hunan, Jiangxi, Guangxi, Chongqing, Zhejiang, and 
Liaoning (Table 1). 


Morphology 

Measurements and descriptions, including shell dimensions, 
radulae, and shell features, followed Strong & Kohler (2009) 
and Du et al. (2019). Radulae and embryonic shells were 
cleaned enzymatically with proteinase K for 30 min, after 
which they were sonicated for 60 s, mounted on aluminum 
specimen stubs with adhesive tabs, and coated with gold 
palladium for analysis via scanning electron microscopy 
(Hitachi S-3000, Japan) at 15 kV. Soft body anatomy was 
observed using a microscope (Leica S6D, 6.3—40x) and was 
described following the terminology used by Strong & Kóhler 
(2009) and Kóhler (2017). 


Molecular analyses 

Genomic DNA was purified via standard phenol-chloroform 
isolation and ethanol precipitation. Partial sequences of 
mitochondrial cytochrome c oxidase subunit | (CO/, 772 bp) 
and 16S ribosomal RNA (16S, 856 bp) were amplified using 
primers F2 and R2 (Du et al., 2019) for CO/, and 16S-F 
(Wilson et al., 2004) and H3059 (Palumbi et al., 1991) for 
16S. Homologous sequences of other semisulcospirids were 
obtained from GenBank, with all new sequences deposited in 
same under accession Nos. MK944096-MK944245, 
MK968961-MK969107, MK991781—MK991783, MN013828- 
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Figure 1 Sampling localities of semisulcospirids in China 
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MN013830 (Table 1). We also acquired various sequences 
from GenBank for use as the ingroup following Du et al. 
(2019), including one Koreoleptoxis species (K. amurensis), 
three Semisulcospira species (S. pleuroceroides, S. gredleri 
(Bóttger, 1886), S. ningpoensis), and thirteen Hua species (H. 
aristarchorum (Heude, 1888), H. telonaria (Heude, 1888), H. 
aubryana, H. bailleti (Bavay & Dautzenberg, 1910), H. textrix, 
H. vultuosa (Fulton, 1914), H. scrupea (Fulton, 1914), H. 
funingensis Du, Kóhler, Yu, Chen & Yang, 2019, H. 
kunmingensis Du, Kóhler, Yu, Chen & Yang, 2019, H. liuii Du, 
Kóhler, Yu, Chen & Yang, 2019, H. tchangsii Du, Kóhler, Yu, 
Chen & Yang, 2019, and two undescribed) (MK251600 — 
MK251768)); Elimia clenchi, E. catenaria, and Melanopsis sp. 
were also obtained from GenBank as the outgroup to root the 
phylogeny in accordance with the trees in Strong & Kóhler 
(2009) and Kóhler (2017). 

Sequence alignment of 16S rRNA and CO! was performed 
with default parameters using Mafft v7.305 (Katoh et al., 2002) 
and the CIPRES Science Gateway (Miller et al., 2010). 
Pairwise distances between species were determined using 
MEGA 7 (Kumar et al., 2016) and the optimal substitution 
model was selected by corrected Akaike Information Criterion 
using jModeltest v2.1.10 (Posada, 2008). MRBAYES v3.2.6 
(Ronquist et a/., 2012) and RAxML-HPC v.8.2.10 (Stamatakis, 
2014) were used for Bayesian inference and maximum 
likelihood analyses, respectively, in reference to the selected 
model of sequence evolution. Bayesian posterior probabilities 
(BPPs) of nodes were determined using Metropolis-coupled 
Markov chains (one cold chain) for five million generations, 
with sampling every 100 generations. Node support for 
maximum likelihood analysis was determined using 1 000 
rapid bootstrap replicates. 


RESULTS 


Sequence analysis and mitochondrial phylogeny 

The final concatenated dataset contained 221 16S sequences 
and 220 COI sequences. The 16S alignment contained 567 
nucleotide positions after pruning alignment ends and 
removing poorly aligned portions. The CO/ dataset had 762 
nucleotide positions and a total length of 1 339 bp, including 
553 variable sites and 497 phylogenetically informative sites. 
The GTR+tI+G model was selected as the best-fit model of 
nucleotide substitution. The dataset contained sequences 
from previously published studies, which were obtained from 
GenBank, as well as the new sequences, which were 
deposited in GenBank (accession Nos. shown in Table 1). 
Sequences of Western North America Juga, type species of 
the genera Semisulcospira libertina and Koreoleptoxis globus 
ovalis, and other published sequences of K. nofifila and S. 
reiniana from GenBank were included to check the 
phylogenetic positions of the Chinese semisulcospirids. 

Both phylogenetic analyses produced identical, usually well- 
supported topologies consisting of two main clades (A and B), 
which contained sequences of various nominal species 
previously recorded in China. Clade A could be divided into 
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four subclades: subclade A1 contained species P. rotundata, 
P. minensis, and L. pallens; subclade A2 contained one 
undescribed species from Nanpanjiang River, Hua sp. 3, H. 
aristarchorum, and species of Wanga as orphans without 
genus affiliation from southwest China and Guangxi Zhuang 
Autonomous Region; subclade A3 contained H. funingensis 
and an undescribed species, Hua sp. 2, from Nanpanjiang 
River; subclade A4 contained H. telonaria (type species), 
species of Hua from southwest China and Vietnam, and two 
undescribed species (Figure 2). Clade B could be divided into 
three subclades; subclades B1 and B2 contained species of 
viviparous Semisulcospira from Japan and China; subclade 
B3 contained oviparous K. globus ovalis, K. nofifila, and K. 
amurensis, and species from the lower Yangtze River (Figure 
3). These clades revealed an approximate phylogeographic 
structure, which roughly adhered to the major drainage 
Systems in China (Figure 1). Results indicated that clade A 
(black) occurred predominantly in southwest China and 
drainages of the upper Yangtze and Pearl rivers, whereas 
subclades B2 (red) and B3 (blue) mainly occurred in the 
middle and lower Yangtze River, with a few specimens 
collected from northeast China (Figure 1). 


Comparative morphology 

We identified several patterns largely consistent with the 
phylogenetic clustering in the mitochondrial trees shown in 
Figure 2 and Figure 3. Subclades B1 and B2 are viviparous 
and mature females do not possess an ovipositor pore or egg- 
laying groove below the right cephalic tentacle; furthermore, 
the lower margin of the central teeth is straight in both 
subclades. Clade A and subclade B3 are both oviparous, 
but show differences in reproductive anatomy; specifically, 
mature females in clade A possess an ovipositor pore or 
egg-laying groove below the right cephalic tentacle, whereas 
mature females in subclade B3 possess both an ovipositor 
and egg-laying groove. In addition, the lower margin of the 
central teeth is w-shaped in clade A, but triangular in 
subclade B3. 

Regarding operculum features, the nucleus is obvious and 
located in the lower base of the operculum in all subclades, 
except subclade A1, in which it is not obvious (Figure 4A-B). 
In subclade A3, the nucleus is very small and located in the 
lower one sixth of the operculum (Figure 4C-D); in subclades 
A2 and A4 and clade B, the operculum shows a clear spiral 
growth line and the nucleus is located in the lower one third of 
the operculum (Figure 4E-H). 

In clade A, species of subclades A1 and A4 (except H. 
aubryana and H. jacqueti) as well as H. sp. 3, and H. 
funingensis, possess smooth shells, whereas all other species 
possess a sculptured shell. In P. minensis, L. pallens, H. liuii, 
and H. funingensis, the lateral teeth exhibit a single prominent 
squarish cusp without small denticles at either side (Figure 
5B), whereas, in all other species, the lateral teeth exhibit two 
to four small denticles at each side (Figure 5A). In P. minensis, 
P. rotundata, L. pallens, H. sp. 3, and H. textrix, the mature 
females possess an egg-laying groove, whereas mature 
females in all other species possess an oviparous pore. In 
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Colored circles indicate origin of samples as indicated on map in Figure 1. 
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Figure 3 Bayesian phylogram of clade B based on analysis of concatenated dataset of mitochondrial cytochrome c oxidase subunit | 
(CON and 16S rRNA sequences. 


Colored circles indicate origin of samples as indicated on map in Figure 1. 
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Figure 4 Opercula of semisulcospirids from China 


A: Hua pallens; B: H. rotundata; C: H. funingensis; D: H. sp.2; E: H. telonaria; F: Semisulcospira pleuroceroides; G: N. ningpoensis; H: K. 


amurensis. Scale bar: 1 mm. 


subclade B2, S. ningpoensis, S. gredleri, and S. cinnamomea 
(Gredler, 1887) have well-sculptured shells, whereas all other 
Chinese species possess a smooth shell. In subclade B3, K. 


amurensis and Melania pacificans Heude, 1888 have well- 
sculptured shells, whereas all other Chinese species have a 
smooth shell. 





Figure 5 Radulae of semisulcospirids from China 
A: Hua telonaria; B: H. funingensis; C: Koreoleptoxis; D: Semisulcospira. 


Mitochondrial differentiation 

Pairwise comparisons of CO! distances revealed that clade A 
and subclades B2 and B3 differed from each other by average 
uncorrected P-distances of 14.6% to 14.7% (average 14.6%). 
Species within clade A differed by an average uncorrected P- 
distance of 0.896 to 14.596 (average 11.296). Species within 
subclade B2 differed by an average uncorrected P-distance of 
10.0% to 14.9% (average 12.0%). Species within subclade B3 
differed by an average uncorrected P-distance of 1.296 to 
11.2% (average 6.6%). 


Scale bar: 100 um. 


DISCUSSION 


In this study, the Chinese semisulcospirids were divided into 
two main clades, i.e., clades A and B. Minton & Lydeard 
(2003) reported maximum intergeneric differences of 15.5% 
for COI in Semisulcospiridae. Here, we observed a maximum 
intergeneric divergence of 14.796. Hence, clade A, together 
with subclades B2 and B3, represented the genera of Hua, 
Koreoleptoxis, and Semisulcospira, respectively. 
Morphologically, the three genera can be distinguished from 
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each other by reproductive mode, radula, and female 
reproductive features. 

Liu et al. (1993) and Xu (2007) assigned Senckenbergia 
pleuroceroides and Namrutua ningpoensis into 
Semisulcospira, but did not provide a reason for this decision. 
Du et al. (2019) treated Namrutua and Senckenbergia as 
junior synonyms to Semisulcospira based on morphological 
and phylogenetic evidence. In addition to three Chinese 
species, i. e, S. gredleri, S. ningpoensis, and S. 
pleuroceroides, four other species, i. e., S. cinnamomea 
(Gredler, 1887) from Yangtze River, S. calculus (Reeve, 1860) 
from Liaoning Province in northeast China, and two 
undescribed species from Zhejiang Province were contained 
in this study. Semisulcospira cinnamomea was initially 
described as a subspecies of S. gredleri (Gredler, 1887); 
however, it is treated as a valid species in this study as it can 
be distinguished from S. gredleri by its six to nine whorls (vs. 
four to five) and body whorl not inflated (vs. inflated). 
Furthermore, M. calculus and M. paucicincta Martens 1894 
are virtually identical (shell size, proportion, rate of whorl 
expansion, and color bands), and are here treated as 
synonyms, with Semisulcospira calculus considered as the 
valid species name due to the description of M. calculus 
predating that of M. paucicincta. The Chinese species of 
Semisulcospira can be distinguished from each other by their 
shell features (sculptured or smooth, body whorl inflated or 
not, number of whorls) and radulae. 

Several species were originally described as members of 
Paludomus, including P. Kkweichouensis, P. minensis, P. 
qianensis, and P. cinctus (Chen, 1937, 1943; Liu et al., 1994). 
Although the Chinese Paludomus species are similar to P. 
conica Gray (type species of Paludomus, type location in 
India) in operculum features, Abbott (1948) stated that the 
mantle margin in P. conica contains 20 to 25 papillae, whereas 
the mantle margin in Chinese Paludomus is smooth. 
Furthermore, in the current study, phylogenetic evidence 
indicated that Chinese Paludomus species recorded in 
southwest China could belong to Hua (Semisulcospiridae). In 
addition, P. minensis and L. pallens, which come from the 
same locality, are virtually identical, except that the body whorl 
in L. pallens is more inflated than that in P. minensis; thus, 
here, they are deemed to be synonyms, with Hua pallens 
considered as the valid species name as the description of L. 
pallens predates that of P.  minensis. Furthermore, P 
kweichouensis is treated as a synonym of M. intermedia 
Gredler, 1884 due to their virtually identical shells and same 
locality, with Hua intermedia considered as the valid species 
name due to the description of M. intermedia predating that of 
P. kweichouensis. 

Chen (1943) originally described Wanga (containing eight 
species) and Hua (containing eleven species). In this study, 
however, the species of Wanga and Hua from southwest 
China and Guangxi are placed in Hua, which therefore 
contains 19 species in China, i. e., H. aristarchorum, H. 
aubryana, H. bailleti, H. funingensis, H. intermedia, H. 
jacqueti, H. kunmingensis, H. liuii, H. pallens, H. rotundata, H. 
scrupea, H. textrix, H. telonaria, H. tchangsii, H. vultuosa, and 
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four undescribed species (Du et al., 2019); other species from 
south of the middle and lower Yangtze River are placed into 
Koreoleptoxis, which therefore contains K. diminuta (Boettger, 
1887), K. friniana (Heude, 1888), K. joretiana (Heude, 1890), 
K. naiadarum (Heude, 1890), K. oreadarum (Heude, 1888), K. 
parenotata (Gredler, 1884), K. schmackeri (Boettger, 1886), 
and K. toucheana (Heude, 1888). 

Within Hua, species can be distinguished from each other 
by the shell (sculptured or smooth, elongated or ovate), 
operculum, radula, and female reproductive features. 
Additionally, within — Koreoleptoxis, species can be 
distinguished from each other by the shell (sculptured or 
smooth, elongated or ovate) and radula features. Although K. 
pacificans and K. terminalis are not monophyletic in the 
phylogenetic tree, K. pacificans can be distinguished from K. 
terminalis by morphological characteristics: i.e., shell solid 
and sculptured (vs. thin and smooth) and same number of 
inner and outer margin teeth (vs. different). Hence, K. 
pacificans and K. terminalis are treated as valid species in 
this study. Incongruence between mtDNA phylogeny and 
morphospecies is not a rare phenomenon in 
semisulcospirids (Kóhler, 2016, 2017). Retention of ancestral 
polymorphisms, introgression, heteroplasmy, recombination, 
and the presence of morphologically cryptic species or 
taxonomic misidentification may explain why mtDNA 
phylogenies do not accurately reflect the relationships of 
their host organisms (Ballard & Whitlock, 2004; Edwards & 
Beerli, 2000; Funk & Omland, 2003; Köhler & Deein, 2010; 
Kóhler, 2016; White et al., 2008). 
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ZOOLOGICAL RESEARCH 


Sinocyclocheilus sanxiaensis, a new blind fish from the 
Three Gorges of Yangtze River provides insights into 
speciation of Chinese cavefish 


DEAR EDITOR, 


A blind fish of Sinocyclocheilus (Cypriniformes: Cyprinidae) 
was caught in open water in the Three Gorges (Sanxia) 
reservoir, at a depth of 20 m in the mainstream of Yangtze 
River in Zigui County, Hubei Province, China. This fish can be 
easily distinguished from all other congeners by external 
morphological characteristics, and is estimated to have 
diverged from its sister group about 0.55 million years ago 
(Ma). The geologically well separated locality of this species 
has expanded the distribution of Sinocyclocheilus cavefish 
from around N25° (latitude) to above N30? . Herein, we 
describe this new species as Sinocyclocheilus sanxiaensis 
sp. nov., and discuss the possible reasons why the species 
appears, surprisingly, in the Three Gorges reservoir. 

Cavefish, or hypogean fish, are a special group of aquatic 
organisms that spend much (or all) of their lives restricted to 
cave or subterranean river habitats (Ma et al., 2019). Many of 
these fishes possess distinct and highly specialized 
morphological troglomorphisms, such as rudimentary eyes 
and scales and loss of pigmentation (Yang et al., 2016). China 
has the highest number of cavefish in the world, with more 
than 150 species mainly distributing in the vast karst areas in 
southwestern China (Ma et al., 2019). 

Cavefish belonging to Sinocyclocheilus are endemic, and 
also the most speciose cavefish group in China. Members of 
this genus are characterized by their compressed bodies, well- 
developed cephalic lateralis systems, two pairs of barbels, 
and especially, the extensively varied troglomorphisms. So far, 
all species of Sinocyclocheilus have been found only in 
Yunnan,  Guizhou provinces, and Guangxi Zhuang 
Autonomous Region, within a very narrow distribution area 
(about 270 000 km?) at ~ N25° latitude (Zhao & Zhang, 2009). 
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Sinocyclocheilus species diversity was systematically 
reviewed by Zhao & Zhang (2009), who recognized a total of 
49 valid species up to that date. However, the past ten years 
have witnessed many new Species discoveries 
(Supplementary Figure S1), with most found in relatively 
inaccessible or formerly unexplored subterranean rivers. 
Alongside with the description of the new species, we briefly 
review the taxonomy and distribution of Sinocyclocheilus, 
which corroborates the high Sinocyclocheilus species diversity 
with an update of 75 valid species (Supplementary Table S1). 

The type specimen of S. sanxiaensis was deposited in the 
Kunming Natural History Museum of Zoology, Chinese 
Academy of Sciences (KNHM). Measurements and counts 
were taken point-to-point with a digital caliper (0.1 mm 
precision) on the left side of the fish whenever possible. To 
facilitate comparisons to congeners, morphometric and 
meristic characters were selected following Zhao et al. (2006). 
Morphological data from congeners described before 2009 
were taken from the review of Zhao & Zhang (2009), with 
checks of the original literature for any imprecise descriptions 
or discrepancies. For congeners described since 2009, we 
checked original descriptions for morphological comparisons. 
For morphologically similar species, we also examined 
specimens deposited in KNHM. 

To clarify the phylogenetic position of the new species, and 
to estimate divergence time from its putative sister species, 
mitochondrial cytochrome b (cyt b) was sequenced and 
analyzed. DNA was extracted from the pectoral fin on the right 
side of the fish. Primer pairs were selected according to Zhao 
et al. (2006). The cyt b sequences of other species in 
Sinocyclocheilus were downloaded from GenBank, with one 
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representative sequence selected for each species for the 
following phylogenetic analyses. Sequences were aligned 
using ClustalW in MEGA v7 (Kumar et al., 2016), and then 
inferred the phylogenetic relationship and estimated 
divergence time using BEAST v2.4.6 (Bouckaert et al., 2014). 
As there are no suitable fossil records for Sinocyclocheilus, 
we estimated divergence time using a "standard clock" that 
roughly assumed an evolutionary rate of 2% per million years 
for mitochondrial DNA, which has been broadly applied in fish 
previously (Gharrett et al., 2001). 


Taxonomy 

Sinocyclocheilus sanxiaensis sp. nov. Jiang, Li, Yang & 
Chang 

Figure 1A-E; Table 1. 


Holotype: KNHM 2019000001, 164.1 mm standard length 
(SL), 194.6 mm total length (TL); China: Hubei Province, Zigui 
County, Guojiaba Town; near the north bank of the Three 
Gorges (Sanxia) reservoir, mainstream of Yangtze River (N30? 
58'09.92", E110?44' 16.14"; about 144 m a.s.l.); caught on 28 
February 2019 by Jie Li. Tissue of the pectoral fin (right side) 
was taken for DNA sequencing. 


Diagnosis: This species can be distinguished from all other 
congeners by the following combination of characters: (1) 
albinotic body without pigmentation, (2) absence of normal 
eyes, (3) absence of dotted eyes, (4) absence of forehead 
horn, and (5) normal fusiform body with only slight uplift of 
back along and behind head, and two pairs of very short 
barbels (see more details in Table 1). 


Description: Body compressed. Dorsal profile convex, ventral 
profile slightly concave, tapering gradually toward caudal fin. 
Greatest body depth slightly anterior to dorsal fin insertion. 

Head compressed with no eyes. Eye orbit filled with fat 
tissue. Nares at 1/3 between snout tip and anterior margin of 
orbit. Anterior nares possessing anterior rim with posterior 
fleshy flap forming half-tube. Mouth subterminal, with slightly 
projecting upper jaw. Two pairs of very short barbels: maxillary 
barbel insertions in front of anterior nares, not extending to 
nares; rictal barbels slightly longer, but not reaching anterior 
margin of orbit. Gill opening large, opercular membranes not 
connected at isthmus. Joint of dentary-angular not close to 
each other at isthmus. Gill rakers developed, seven on first gill 
arch. 

Pectoral fin long (2496 SL), extending posterior to pelvic fin 
insertion and reaching to 1/3 of pelvic fin. Pelvic fin insertion 
slightly anterior to vertical through dorsal fin insertion, nearer 
to pectoral than to anal fin insertions; pelvic fin extending just 
beyond anus to anal fin insertion. Dorsal fin origin at 5896 
distance from snout tip to caudal-fin base and slightly 
posterior to vertical through pelvic fin insertion. Last 
unbranched spine of dorsal fin hard with serrations along 
posterior edge. Anal fin 1796 SL, with insertion nearer to pelvic 
fin origin than to caudal fin base. Caudal fin bifurcate. 

Lateral line complete and curved, descending greatly from 
posterior margin of operculum to point above pelvic fin 
insertion, descending slightly to point above anal fin insertion, 


then extending horizontally to end of caudal peduncle. Body 
covered by small soft scales, irregularly degenerated. Scales 
not easily visible to naked eye, partially embedded into 
epidermis. Lateral line scales similar in size to other scales. 
Lateral line scale: 41; scale row counts above lateral line, 
below lateral line, and circumpeduncular scales 16, 10, 16, 
respectively. 


Coloration: In life, body generally white, with slightly flesh 
pink on posterior part (after dorsal fin); red block visible on 
lateral head, caused by red gills penetrating at posterior part 
of operculum. Holotype was fixed in 1096 formalin and then 
preserved in 90% alcohol. In preservation, body generally light 
yellow; posterior part of operculum and all fins partially 
transparent. 


Distribution and habitat: Known only the holotype. Although 
specimen was caught using a cage (without bait) in open 
water at a depth of 20 m, the birthplace (or real habitat) was 
still uncertain as it was a typical cavefish. The reservoir was 
gradually formed after the closure of the Three Gorges Dam in 
1997. There are many karst landforms in the Three Gorges 
area. On 3-4 July 2019, we interviewed several local people, 
who described many caves and subterranean rivers near the 
type locality. One local had previously been told that blind fish 
existed in one cave. However, all caves mentioned were 
underwater according to our on-site survey. 


Etymology: The name of the new species, sanxiaensis, is de- 
rived from the Chinese name of the Three Gorges, = (Sanxia). 


Phylogenetic position and divergence time: Phylogenetic 
reconstruction showed that S. sanxiaensis is the sister 
species of the subclade S. cyphotergous + S. multipunctatus, 
with strong support (1.0 Bayesian posterior probability, Figure 
1F) The P-distances between S. sanxiaensis and S. 
cyphotergous, between S. sanxiaensis and S. multipunctatus 
were 1.5% and 1.3%, respectively (see more in 
Supplementary Table S2). These three species then grouped 
with S. macrolepis to form a major clade. Divergence time 
based on the constant evolutionary rate of cyt b indicated that 
S. sanxiaensis separated from S. cyphotergous + S. 
multipunctatus at ~0.55 Ma (0.36—0.77, 95% highest posterior 
densities (HPDs)) and these three species separated from S. 
macrolepis at 4.86 Ma (4.15—5.57, 9596 HPDs). Additionally, 
divergence time estimation showed the origin of 
Sinocyclocheilus to be 7.34 Ma (6.61—8.06, 95% HPDs). 


Remarks 

Phylogeny and origin of S. sanxiaensis 

Although the Chinese cavefish in Sinocyclocheilus exhibit very 
high species diversity, our understanding of how these 
species evolved is far from resolved. This has been restricted 
due to a lack of well-resolved phylogenetic relationships. In 
combination of morphological characters and molecular cyt b 
sequence, Zhao & Zhang (2009) proposed a four species 
group hypothesis, named by the representative species in 
each group: i. e. S. ji, S. angularis, S. tingi, and S. 
cyphotergous, and characterized by different morphological 
features and distribution in specific regions. Based on all 
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Figure 1 Holotype, phylogenetic position and distribution of S. sanxiaensis 

Pictures show lateral view (A), X-ray lateral view (B), lateral head (C), dorsal head (D), and ventral head (E) of the holotype; Phylogenetic position of 
S. sanxiaensis (red) with the estimated divergence time in Sinocyclocheilus (F); Distributions of S. sanxiaensis (red, No. 75) and all other congeners 
(G). The abbreviations of rivers and numbers of other species (No. 1—74) are given according to Supplementary Table S1. 
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Table 1 Meristics and morphometrics of S. sanxiaensis and its diagnosis relative to all other congeners 





























Item Value Diagnosis 

Dorsal fin iii,8 

Anal fin iii,5 

Pectoral fin 1,14 

Pelvic fin L7 

Total vertebrae 34 (1) Distinguished from most congeners (55) by presence of albinism (loss of pigmentation in the 

Predorsal vertebrae 5 skin) vs. non-albinism, except for 19 species: S. altishoulderus, S. anatirostris, S. anophthalmus, S. 
anshuiensis, S. aquihornes, S. bicornutus, S. brevis, S. convexiforeheadus, S. flexuosdorsalis, S. 

Caudal vertebrae 18 furcodorsalis, S. hugeibarbus, S. hyalinus, S. jiuxuensis, S. lingyunensis, S. macrophthalmus, S. 

Gill rakers 7 mashanensis, S. microphthalmus, S. tianlinensis, and S. xunlensis. 

Lateral line scales 41 

Scale row above lateral line 16 

Scale row below lateral line 10 

Circumpeduncular scale 16 

Standard length (mm) 164.1 

In % of standard length 

Body depth 28.5 
(2) Distinguished from S. lingyunensis and S. macrophthalmus by absence of eyes vs. normal eyes. 

Predorsal length 58.0 

Dorsal fin base length 15.7 

Dorsal fin length 21.6 

Preanal length 73.3 

Anal fin base length 8.7 

Anal fin length 17.1 (3) Distinguished from S. altishoulderus, S. bicornutus, S. brevis, S. hugeibarbus, S. jiuxuensis, S. 

Prepectoral length 34.2 mashanensis, and S. microphthalmus by absence of eyes vs. dotted eyes (reduced but visible). 

Pectoral fin base length 4.4 

Pectoral fin length 24.1 

Prepelvic length 54.8 

Pelvic fin base length 4.4 

Pelvic fin length 17:7 

Caudal peduncle length 18.6 (4) Distinguished from S. anatirostris, S. anshuiensis, S. aquihornes, S. convexiforeheadus, S. flex- 
uosdorsalis, S. furcodorsalis, S. hyalinus, and S. tianlinensis by absence of forehead horn vs. fore- 

Caudal peduncle depth 11.6 head horn present (different types). 

Head length 33.6 

Head depth 20.1 

Head width 17.4 

Snout length 16.4 

Eye diameter 7.3 

Interorbital width 10.7 

Prenostril length 7.1 (5) Distinguished from S. anophthalmus and S. xunlensis by only slight uplift of back along and after 

Width between posterior nostrils 7.0 head (vs. rapid) and very short barbels (vs. long). In addition, Sinocyclocheilus sanxiaensis can be 
further distinguished from S. anophthalmus by fewer vertebrae (34 vs. 38-39) and fewer lateral line 

Upper jaw length 8.9 scales (41 vs. 48—59); and from S. xunlensis by normal and subterminal mouth (vs. duckbilled 

Lower jaw length 8.3 mouth). 

Mouth width 9.7 

Maxilla barbel length 4.2 

Rictal barbel length 6.5 
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available cyt b sequences from GenBank, we reconstructed a 
phylogenetic tree from 40 species (Figure 1F). Although the 
phylogenetic relationship differed slightly from that of Zhao & 
Zhang (2009), Sinocyclocheilus sanxiaensis would be a 
member in the "S. cyphotergous group" according to its 
phylogenetic position. 

The sister relationship of S. sanxiaensis and (S. 
cyphotergous + S. multipunctatus) is understandable as S. 
multipunctatus is the only species that has been recorded as 
far north as the Yangtze River (specimens have been reported 
in Wujiang, a tributary of the Yangtze River in Zunyi County of 
Guizhou Province). The divergence time between S. 
sanxiaensis and its sister group was estimated to be very 
young (0.55 Ma) relative to the origin time of Sinocyclocheilus 
(7.35 Ma, Figure 1F). The complete channelization of the 
Yangtze River is generally thought to have formed by the 
lower reach capturing the middle and then the upper reaches, 
which was associated with the stepwise growth of the Qinghai- 
Tibetan Plateau (Clark et al., 2004; Zhang et al., 2018; Zheng 
et al., 2013). However, the start of the Yangtze throughflow 
and the key point of the incision of the Three Gorges have 
variously been argued to postdate 0.75 Ma at one extreme 
and to be as early as 40—45 Ma at the other (Zheng et al., 
2013). If the speciation of S. sanxiaensis (~0.55 Ma) was the 
direct product of the formation of the Three Gorges, the time 
of the incision of the Three Gorges would be very late, as 
proposed recently (0.3-0.12 Ma, Zhang et al., 2018). However, 
we cannot exclude the greater possibility of an occasionally 
regional capture scenario later than the incision of the Three 
Gorges: some northward tributary of the Wujiang River 
captured a previously southward tributary of the Pearl River, 
which could have possibly transported the ancestor of S. 
sanxiaensis from the Pearl River into the Wujiang and also 
into the Yangtze River basin. This could have happened 
anytime because of the well-developed subterranean rivers in 
the board areas between Guizhou and Guangxi, which would 
withdraw any direct links to the time of incision of the Three 
Gorges. 


Possible reason for occurrence of S. sanxiaensis 

Our discovery does raise the question as to how the new 
species arrived in the Three Gorges reservoir of the 
mainstream of the Yangtze River. Given the massive karst 
landforms and abundant underground water systems found 
within the Three Gorges area (Xia et al., 2010), we advocate 
that the new species likely came from a subterranean river 
draining into the Yangtze River. The rise of water after the 
impoundment of the Three Gorges Dam may have facilitated a 
more direct link between some subterranean rivers and the 
Yangtze River. Not only did the dam increase water height and 
coverage area, it also slowed the velocity of water flow. 
Interestingly, there were many deep channels found in the 
Yangtze River before the construction of Three Gorges Dam, 
especially in the Xiling Gorge (Jing & Liu, 1985), which 
includes the river reach where we collected this species. Our 
on-site habitat survey further confirmed underground rivers 
with several karst caves near the type locality. Of note, one 
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local resident claimed that blind fish are known to exist in a 
nearby cave (about 4 km from type locality). Although these 
underground rivers and caves have been submerged with the 
impoundment of the Three Gorges Dam, it is reasonable to 
infer that this species could come from an exit of these 
underground river systems. However, whether the species 
exited actively or passively remains unknown. The date when 
we collected the specimen (28 February 2019) was during the 
spawning season of most Sinocyclocheilus species; therefore, 
it is possible that the fish actively exited in an attempt to 
explore a new environment (cave-like deep water) to meet 
breeding requirements. However, turbulence due to geological 
activities within original underground rivers could also result in 
possible passive spread. 

In addition, we still know almost nothing about the life 
history of this species. However, the discovery of this single 
specimen should encourage further efforts to survey, study, 
and protect this rare species. The existence of this fish also 
indicates that junction areas within the Chongqing, Hubei, 
Hunan, and  Guizhou provinces likely harbor other 
undiscovered Sinocyclocheilus cave species (Figure 1G); thus 
far, only three cavefish in the genus Triplophysa have been 
reported (Huang et al., 2019). 


NOMENCLATURAL ACTS REGISTRATION 


The electronic version of this article in portable document format will 
represent a published work according to the International Commission on 
Zoological Nomenclature (ICZN), and hence the new names contained in 
the electronic version are effectively published under that Code from the 
electronic edition alone (see Articles 8.5—8.6 of the Code). This published 
work and the nomenclatural acts it contains have been registered in 
ZooBank, the online registration system for the ICZN. The ZooBank LSIDs 
(Life Science Identifiers) can be resolved and the associated information 
can be viewed through any standard web browser by appending the LSID 
to the prefixhttp://zoobank.org/. 

Publication LSID: 

urn:lsid:zoobank.org:pub: AC302B8C-E445-48A0-9086-C90C40279690. 
Nomenclatural act LSID: 

urn:Isid:zoobank.org:act: 7844B164-1AE4-4617-BC23-49745BE21ED4. 
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ZOOLOGICAL RESEARCH 


A new species of the genus Raorchestes (Anura: 
Rhacophoridae) from Yunnan Province, China 


DEAR EDITOR, 


A new bush frog species, Raorchestes cangyuanensis sp. 
nov., from Cangyuan, Yunnan Province, China, is described 
based on morphological and molecular analyses. It differs 
from all known congeners by a combination of the following 
characters: body size small, adult snout-vent length (SVL) 
16.1—20.0 mm in males (n=3); tympanum indistinct; tips of all 
fingers and toes expanded into discs with circummarginal 
grooves; rudimentary webbing between toes; fingers and toes 
with lateral dermal fringes; inner and outer metacarpal 
tubercles present; heels meeting when limbs held at right 
angles to body; crotch with a distinct black patch; discs of 
fingers and toes orange; male with external single subgular 
vocal sac and reddish nuptial pad at the base of first finger. 
The genus Raorchestes Biju, Shouche, Dubois, Dutta, and 
Bossuyt, 2010 ranks in the top two most speciose genera of 
Rhacophoridae. Raorchestes is characterized by adult snout- 
vent lengths between 15.0 mm and 45.0 mm, no vomerine 
teeth, transparent/translucent vocal sac while calling, direct 
development without free-swimming tadpoles and nocturnal 
lifestyles (Biju et al., 2010; Vijayakumar et al. 2016). 
Raorchestes currently contains 62 species, ranging from the 
southern tip of the Indian Peninsula to northeastern India, 
Indo-China, and southwestern China (Frost, 2019). Thirty-two 
new species have been described in the last decade, entirely 
in India (Biju et al., 2010; Biju & Bossuyt, 2009; Padhye et al., 
2013; Seshadri et al., 2012; Vijayakumar et al., 2014; 
Zachariah et al., 2011). Southeast Asia and southern China 
(SEA-SC) contain only four known species which are R. 
gryllus, R. parvulus, R. menglaensis, and R. longchuanensis, 
with no new species records for Raorchestes in SEA-SC in 
recent years. The high density of recently described species in 
other regions suggests that cryptic lineages may exist in more 
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depauperate areas of Raorchestes distribution that have 
received less investigative attention. 

Southwestern China is a global biodiversity hotspot that 
harbor a high diversity of amphibian species (AmphibiaChina, 
2019; Myers et al., 2000), owed largely to its complicated 
topography (altitude ranges from «2 000 m in some valleys to 
7 558 m a.s.l. at the summit of Gongga Mountain) and variety 
of habitats and  climates. The  amphibian fauna in 
southwestern China is rich in terms of species count and 
endemism (AmphibiaChina, 2019; Frost, 2019). In recent 
years, several cryptic and new species of amphibians have 
been described (Chen et al., 2017, 2018; Li et al., 2018; Lyu 
et al., 2019; Wang et al., 2019; Yang et al., 2016; Yang & 
Chan, 2018; Yu et al., 2019; Yuan et al., 2018). These results 
suggest that the rich amphibian diversity in the region still 
remains underestimated. During fieldwork in Cangyuan, 
Yunnan Province, southwestern China, we collected 
specimens that superficially resembled R. menglaensis, R. 
longchuanensis, and R. parvulus, which potentially occur in 
this region. We evaluated these individuals using molecular 
and morphological phylogenetic analyses. Based on an 
integrative taxonomic approach, we identified a distinct 
evolutionary lineage and describe it as a new species of the 
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genus Raorchestes. 

A total of three specimens were collected at night by 
locating calling males (Figure 1A). The frogs were euthanized 
using benzocaine, and liver tissues taken and preserved in 
95% ethanol. The specimens were then fixed with 10% 
formalin for 24 hours and subsequently transferred to 70% 
ethanol for permanent preservation. Voucher specimens were 
deposited in the herpetological collection of the Museum of 
the Kunming Institute of Zoology (KIZ), Chinese Academy of 
Sciences (CAS). 

Total genomic DNA was extracted, and a partial fragment of 
the mitochondrial 16S rRNA gene (16S) was amplified and 
sequenced. DNA extraction, primers, and PCR cycle protocols 
are shown in the Supplementary Methods. To study the 
historical relationships among  Raorchestes species, 
matrilineal genealogies (phylogenetic trees) were 
reconstructed based on the 16S fragment. Homologous 
sequences of Raorchestes and representative outgroups 
(Kurixalus idiootocus, K. eiffingeri, Polypedates cruciger, and 
Rhacophorus malabaricus) were downloaded from GenBank 
(Supplementary Table S1). Phylogenetic relationships were 
inferred using maximum likelihood (ML) and Bayesian 
inference (BI) methods (Supplementary Methods). We also 
calculated pairwise sequence divergence using uncorrected P- 
distances implemented in MEGA v6.0.6 (Tamura et al., 2013). 

The three preserved adult specimens were measured with 
digital calipers to the nearest 0.1 mm. Measurements followed 
Fei et al. (2009) and Poyarkov et al. (2017) (Supplementary 
Methods). Comparative data on the morphology and 
taxonomy of Raorchestes were obtained from previous 
publications (Bossuyt & Dubois, 2001; Fei et al., 2009; Fei et 
al., 2012; Kou, 1990; Kuramoto & Joshy, 2003; Padhye et al., 
2013; Smith, 1924; Vijayakumar et al., 2014; Yang et al., 
1978; Yu et al., 2019). 

The topologies recovered by both ML and BI analyses were 
essentially identical, with relatively robust support for most 
terminal clades (Figure 1B). The monophyly of Raorchestes 
was strongly supported and in agreement with the results of 
Biju et al. (2010) and Vijayakumar et al. (2016). Among other 
members of the genus Raorchestes, the Cangyuan population 
represented a distinct phylogenetic lineage with strong 
support (Bayesian posterior probabilities=1; bootstrap 
support-100; Figure 1B). The genetic distance between the 
Cangyuan population and other described species of the 
genus ranged from 5.496 (with R. gryllus) to 14.196 (with R. 
archeos) (Supplementary Table S2). A 3% P-distance value 
for 16S rRNA is considered a useful indicator for new 
candidate species in frogs (Vieites et al., 2009). Thus, genetic 
divergence between the Cangyuan population and its 
congeners exceeds the proposed threshold for species-level 
differentiation in frogs. Morphologically, the newly identified 
matriline differed from all named species. Thus, we describe 
the new species of the genus Raorchestes below. 


Taxonomic account 


Raorchestes cangyuanensis sp. nov. Wu, Suwannapoom, Xu, 
Murphy et Che (Figure 1; Table 1) 


Holotype: Adult male (KIZ015856) from Cangyuan County, 
Yunnan Province, China (N23.22542°, E99.22509°, 1272 ma. 
s.l.), collected by Da-Hu Zou and Kai Xu on 25 May 2016. 


Paratypes: Two males KIZ015855 and KIZ015857, collected 
by Da-Hu Zou and Kai Xu. Location and date are the same as 
those of the holotype. 


Diagnosis: Raorchestes  cangyuanensis sp. nov. is 
diagnosed as a member of the genus Raorchestes by the 
following morphological characters: small body size; vomerine 
teeth absent; nocturnally active; tips of all fingers and toes 
expanded into discs with circummarginal grooves. The new 
species is distinguished from geographically and molecularly 
relevant congeners by the following combination of 
characters: (1) body size small, adult SVL 16.1—20.0 mm in 
males (n=3); (2) tympanum indistinct; (3) tongue pyriform, with 
a deep notch at the posterior tip; (4) tips of all fingers and toes 
expanded into discs with circummarginal grooves; (5) no 
webbing between fingers; (6) rudimentary webbing between 
toes; (7) fingers and toes with lateral dermal fringes; (8) inner 
and outer metacarpal tubercles present; (9) inner metatarsal 
tubercle oval, outer metatarsal tubercle absent; (10) heels 
meeting when limbs held at right angles to body; 
(11) tibiotarsal articulation reaching anterior of eye when 
hindlimb is stretched along the side of the body; (12) dark 
brown interorbital triangle between eyes; (13) crotch with a 
distinct black patch; (14) discs of fingers and toes orange; 
(15) dorsal surface brown with a dark ") ("-shaped marking; 
(16) supratympanic fold distinct, from posterior corner of eye 
to above insertion of arm; (17) iris golden brown; (18) male 
with external single subgular vocal sac; and (19) reddish 
nuptial pad at the base of first finger. 


Description of holotype (all measurements in mm; see Table 
1): KIZ015856, adult male (Figure 1). Body size small (SVL= 
20.0); head large, width of head larger than head length 
(maximum head width (HDW) 77.2; head length (HDL) =6.6); 
top of head relatively flat; snout rounded in profile, projecting 
beyond lower jaw; snout length almost equal to diameter of 
eye (eye diameter (ED) =2.6; snout length (SNT) =2.4); 
canthus rostralis rounded, loreal region slightly concave; 
tympanum indistinct; interorbital distance wider than upper 
eyelid width and internasal distance (interorbital distance 
(IOD) =2.4; width of upper eyelid (UEW) =1.6; internarial 
distance (IND) =2.0); interorbital distance between posterior 
margins of eyes 1.9 times that of anterior margins (the 
distance between anterior orbital borders (IFE) =3.5; the 
distance between posterior orbital borders (IBE)=6.8); nostril 
slightly closer to tip of snout than to anterior corner of eyes 
(snout-nostril distance (SN) 71.3; distance from nostril to eye 
(DNE) 21.5); tongue pyriform, with a deep notch at posterior 
tip; vomerine teeth absent; pineal ocellus absent; eyes 
moderately large (eye diameter (ED) =2.6) and protruding, 
pupil horizontal; supratympanic fold distinct, from posterior 
corner of eye to above insertion of arm. 

Forelimbs fairly robust; forelimb length shorter than hand 
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Table 1 Measurements (mm) of Raorchestes cangyuanensis sp. 








nov. 
Catalog No. KIZ015855 KIZ015856* KIZ015857 
Sex Male Male Male 
SVL 19.3 20.0 16.1 
HDL 6.4 6.6 5.5 
HDW 7.2 7.2 6.2 
SNT 2.5 2.4 2.3 
DNE 1.4 1:5 1.1 
IND 1.9 2.0 1.9 
IOD 2.5 24 2.2 
UEW 1.8 1.6 1:7 
ED 2.8 2.6 2.3 
SN 1.1 1.3 0.9 
IFE 3.8 3.5 2.6 
IBE 6.7 6.8 4.7 
FAL 4.2 4.5 3.9 
HL 5.3 5.4 4.9 
THL 9.0 8.9 8.2 
TL 9.0 9.1 TT 
FL 7.8 7.5 6.2 
FLI 1.6 1.8 1.0 
FLII 24 2.1 1.9 
FLIII 3.3 3.7 3.5 
FLIV 24 2.7 2.1 
TLI 1.2 12 1:2 
TLII 1.9 1.7 1.7 
TLIII 3.1 2.9 2.0 
TLIV 3.4 4.2 3.3 
TLV 2.1 2.5 2.0 


For abbreviations, see text and Supplementary Methods. Asterisk (*) 
indicates holotype. 


length (forearm length (FAL) =4.5; hand length (HL) =5.4); 
relative finger lengths: I«II«IV«III (finger length (FL)l=1.8; FLII- 
2.1; FLIIIZ3.7; FLIV=2.7); tips of all four fingers expanded into 
discs with circummarginal grooves; all fingers with lateral 
dermal fringes on both sides; subarticular tubercles distinct, 
rounded, formula: 1, 1, 1, 1; supernumerary tubercles absent; 
no webbing between fingers; inner and outer metacarpal 
tubercles present; male with reddish nuptial pad at the base of 
first finger (Figure 1H). 

Hindlimbs long and relatively robust, thigh length (thigh 
length (THL)=8.9) shorter than tibia length (tibia length (TL)= 
9.1), but greater than foot length (FL=7.5); tibiotarsal 
articulation reaching anterior of eye when hindlimb is 
stretched along the side of the body; heels meeting when 
limbs held at right angles to body; relative toe lengths: I«II«V« 
Ill<IV (toe length (TL) I21.2; TLII21.7, TLIN=2.9; TLIV=4.2; 
TLV=2.5); tips of toes with discs having circummarginal 
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grooves, fewer than discs on fingers; all toes with lateral 
dermal fringes on both sides; subarticular tubercles distinct, 
rounded, formula: 1, 1, 1, 2, 1; supernumerary tubercles 
absent; rudimentary webbing between toes; inner metatarsal 
tubercle rounded, outer metatarsal tubercle absent 
(Figure 1G). 

Dorsal surfaces rough with small granules, flank of body, 
dorsal part of forelimbs, thighs, and tibia relatively smooth and 
scattered with sparse granules; upper eyelid with several 
small granules; throat, chest, and ventral surfaces of forelimbs 
smooth; abdomen, underside of thigh, and around vent with 
granules; dorsolateral folds absent; surfaces of hands and 
feet with numerous granules (Figure 1). 


Color of holotype in life: For color of holotype in life see 
Figure 1. Dorsal surface brown, with golden brown band 
between eyes; dorsal surface with a dark") ("-shaped marking; 
dark brown interorbital triangle between eyes; upper and 
lower lips with white and black dots; supratympanic fold dark 
brown; iris golden brown; dorsal parts of limbs with dark brown 
crossbars; crotch with a distinct black patch bordering large 
creamy white plaque below the black patch near the groin; 
dorsal thigh orange with two black crossbars; ventral surface 
body and limbs brown, with small black and white spots; discs 
of fingers and toes orange (Figure 1). 


Color of holotype in preservative: Dorsum faded to dark 
brown; golden brown band between eyes still clear; a dark ") 
("-shaped marking faintly present on dorsum; black patch 
present at crotch still distinct; large creamy white plaque 
below the black patch still clear; cross bands present on 
dorsal side of forelimbs and hind limb still clear; discs of 
fingers and toes fades to brown; throat, chest, abdomen and 
ventral surface of limbs dark brown, mottled with white dots 
(Supplementary Figure S1). 


Male secondary sexual characteristics: Adult males 
possess nuptial pads covering the dorsal surface of the base 
of Fl; external single subgular vocal sac; and slit-like opening 
at posterior of jaw. 


Variation: Morphometric measurements of holotype and two 
paratypes are given in Table 1. Paratypes generally agree with 
the holotype morphologically, with the following exceptions: 
smaller body length for one of the three adult male specimens 
(KIZ015857); body length of KIZ015857, KIZ015855, and 
KIZ015856 16.1 mm, 19.3 mm, and 20.0 mm, respectively. 
KIZ015855 and KIZ015856 have light brown band between 
eyes, which is absent in paratype KIZ015857. 


Distribution and habitat: Raorchestes cangyuanensis sp. 
nov. is known only from a single locality, Cangyuan County, 
Yunnan, China (N23.22542^, E99.22509° ). The new species 
was found at an elevation of 1 272 m a.s.l. in shrubbery near 
streams. 


Comparisons: Based on morphology, we compared 
Raorchestes cangyuanensis sp. nov. with morphologically, 
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Figure 1 Distribution, Bayesian inference tree and holotype of Raorchestes cangyuanensis sp. nov. 

A: Known distribution of R. cangyuanensis sp. nov. from Cangyuan, Yunnan Province, China. Star shows type locality of the new species. 
B: Phylogram of Raorchestes resulting from analyses of a fragment of the mitochondrial 16S rRNA gene. Nodal support values with Bayesian 
posterior probabilities (BPP)>95/bootstrap support (BS)>70 are shown near node. A “—” denotes bootstrap support<70. Node values with Bayesian 
posterior probabilities (BPP)«95 and bootstrap support (BS)«70 are not shown. Lateral (C), dorsal (D), ventral (E), crotch (F), ventral view of foot 
(G) and hand (H) of male specimen (KIZ015856) of Raorchestes cangyuanensis sp. nov. Photo by Da-Hu Zou. 


geographically, and molecularly similar species. 

Raorchestes cangyuanensis sp. nov. differs from R. 
longchuanensis in the following characters: tympanum 
indistinct in males (vs. distinct); lateral dermal fringes on all 
fingers and toes (vs. only on first and second fingers with 
lateral dermal fringes, lateral dermal fringes of toes absent); 
rudimentary webbing between toes (vs. 1/4 webbing); iris 
golden brown (vs. reddish brown). Raorchestes 
cangyuanensis sp. nov. differs from R. menglaensis by the 
following combination of characters: male with external single 
subgular vocal sac (vs. internal single subgular vocal sac); all 
fingers and toes with lateral dermal fringes (vs. absent); outer 
metatarsal tubercle absent (vs. present); discs of fingers and 
toes orange (vs. not orange). Raorchestes cangyuanensis sp. 
nov. differs from R. gryllus by the following combination of 
characters: SVL of adult male 16.1—20.0 mm (vs. 25.0 
—27.0 mm); rudimentary webbing between toes (vs. little more 
than half webbed); outer metatarsal tubercle absent (vs. 
present). Raorchestes cangyuanensis sp. nov. differs from R. 
parvulus in the following characters: tympanum indistinct (vs. 
distinct); toes with lateral dermal fringes (vs. lateral dermal 
fringes of fifth toe indistinct); SVL of adult male 16.1—20.0 mm 
(vs. 20.1-23.2 mm); supernumerary tubercles absent (vs. 
present on third finger); relative toe lengths: I<ll<V<lll<IV (vs. 
relative toe lengths: I<II<III<V<IV). Raorchestes 
cangyuanensis sp. nov. differs from R. ghatei in the following 
characters: reddish nuptial pad at the base of first finger (vs. 
absent); relative finger lengths: I«II«IV«III (vs. relative finger 


lengths: I<IV<ll<lll); inner and outer metatarsal tubercles 
present (vs. single metatarsal tubercle present); all toes with 
lateral dermal fringes on both sides (vs. absent); dorsal parts 
of limbs with dark brown crossbars (vs. without crossbars). 
Raorchestes cangyuanensis sp. nov. differs from R. 
tuberohumerus in the following characters: inner and outer 
metatarsal tubercles present (vs. inner metatarsal tubercle 
moderate, no outer metatarsal tubercle); relative finger 
lengths: I«II«IV«IlI (vs. relative finger lengths: I<IV<II<III); 
crotch with a distinct black patch bordering large creamy white 
plaque below the black patch near the groin (vs. large yellow 
or reddish yellow markings); discs of fingers and toes orange 
(vs. grey to brown). Raorchestes cangyuanensis sp. nov. 
differs from R. leucolatus in the following characters: crotch 
with a distinct black patch bordering large creamy white 
plaque below the black patch near the groin (vs. groin region 
with white blotches); loreal region slightly concave (vs. loreal 
region flat); relative toe lengths: I«II«V«III«IV (vs. relative toe 
lengths: I«II«IHII«V«IV). 


NOMENCLATURAL ACTS REGISTRATION 


The electronic version of this article in portable document format represents 
a published work according to the International Commission on Zoological 
Nomenclature (ICZN), and hence the new names contained in the 
electronic version are effectively published under that Code from the 
Code). This 
published work and the nomenclatural acts it contains have been registered 


electronic edition alone (see Articles 8.5 - 8.6 of the 
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in ZooBank, the online registration system for the ICZN. The ZooBank 
LSIDs (Life Science Identifiers) can be resolved and the associated 
information can be viewed through any standard web browser by appending 
the LSID to the prefixhttp://zoobank.org/. 

Publication LSID: 
urn:Isid:zoobank.org:pub:97B046A9-A5AC-4EF8-AA43-4AABA31 DAADY. 
Raorchestes cangyuanensis LSID: 

urn:Isid:zoobank.org:act:7 35566BF-4677-479B-B5D0-E52D 1468C229. 
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ZOOLOGICAL RESEARCH 


A new species of the genus Xenophrys (Anura: 
Megophryidae) from northern Thailand 


DEAR EDITOR, 


Species of Xenophrys are conserved morphologically and live 
primarily in forests. In Thailand, the genus harbors many 
cryptic species. Herein we report the collection of specimens 
from Doi Inthanon, Chiang Mai Province, northern Thailand, 
which were identified previously as X. minor. Molecular and 
morphological analyses find that these specimens differ 
significantly from other known congeners, and therefore we 
describe a new species. Further, our phylogenetic analyses 
indicate that X. /atidactyla is a junior synonym of X. 
palpebralespinosa. 

The Indo-Burma and Sundaland biodiversity hotspots span 
Thailand and host a high diversity of amphibian species 
(Myers et al., 2000). In Thailand, at least 32 of 193 species of 
amphibians appear to be endemic (Frost, 2019). Some recent 
studies of Megophrys sensu lato, Leptobrachella and 
Fejervarya have shown that the diversity of amphibians in 
Thailand is underestimated (Chen et al., 2017, 2018; 
Suwannapoom et al., 2016). Further, 10 new species of 
amphibians have been discovered in the past three years 
alone (Frost, 2019; Pawangkhanant et al., 2018; Poyarkov et 
al., 2018; Suwannapoom et al., 2018). Thus, the extensively 
rich amphibian diversity in Thailand appears to remain 
underestimated. Like many other regions, high rates of 
deforestation owe to increased development (Royal Forest 
Department, 2006) and this drives a high risk of extinction 
even before the discovery of new species. 

Genus Xenophrys Gunther, 1864 (family Megophryidae) 
occurs in the southern and eastern Himalayas, Indochina, and 
northward to the Qinling and Huangshan mountains of 
mainland China (Chen et al., 2017). Of the 66 species, 
Thailand has seven species only: X. aceras, X. lekaguli, X. 
longipes, X. major, X. minor, X. parva and X. takensis (Chan- 
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ard, 2003; Mahony, 2011; Nutphund, 2001; Stuart et al., 
2006). Xenophrys minor Stejneger was described originally 
from  Kwanghsien (now Guan Xian, Dujiangyan City), 
55 kilometers northwest of Chengtu (Chengdu), Szechwan 
(Sichuan), China. Several researchers recorded it for montane 
areas of northern Thailand (Chan-ard, 2003; Chuaynkern & 
Chuaynkern, 2012; Nabhitabhata et al., 2000; Nabhitabhata 
and Chan-ard, 2005). Several recent publications revised the 
X. minor complex (Li et al., 2014; Mahony et al., 2013; Wang 
et al., 2012). Further, the type locality occurs in Sichuan, 
China and a distance of over 1 500 km separates them from 
the known localities in northern Thailand. Thus, the taxonomic 
status of Thai populations previously referred to as X. minor 
requires further investigations. 

Our recent fieldwork in Thailand resulted in the collection of 
specimens of Xenophrys cf. "minor" from Chiang Mai Province 
(Doi Inthanon). Further phylogenetic analyses of mtDNA 
sequences and morphological examinations showed that this 
species is distantly related to X. minor from China and can be 
distinguished from all known congeners both by molecular 
and morphological characters. Based on an integrative 
taxonomic approach, we describe this population as a new 
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species of the genus Xenophrys. 

Six individuals were collected from Doi Inthanon, Chiang 
Mai Province, Thailand in August 2017 (Figure 1). After 
specimens were euthanized using benzocaine, liver tissue 
was taken and preserved in 95% alcohol. The specimens 
were fixed with 10% formalin for 24 hours and subsequently 
transferred to 70% ethanol. All specimens were deposited in 
the herpetological collection of the Museum of the Kunming 
Institute of Zoology (KIZ), Chinese Academy of Sciences 
(CAS) and University of Phayao (AUP). Six preserved adult 
specimens were examined and measured to the nearest 0.1 
mm using digital calipers. Morphological terminology followed 
Fei et al. (2009) and Poyarkov et al. (2017) (Supplementary 
Methods). 


A | Posten am 
2552 


a. 


Distance (km) 
25 50 1 





Figure 1 Known distribution of Xenophrys angka sp. nov. from 
Doi Inthanon, Chiang Mai Province, Thailand 

A: Map of Thailand; B: Inset map of northern Thailand. Star shows the 
type locality of the new species. Photo by N.A. Poyarkov. 


Whole genomic DNA was extracted, and a partial fragment 
of the mitochondrial 16S rRNA were amplified and 
sequencing. DNA extraction, primers and PCR cycle protocols 
are in Supplementary Methods. Matrilineal genealogies were 
reconstructed to study the phylogenetic relationships among 
Xenophrys based on the partial mitochondrial 16S rRNA gene. 
Homologous sequences of related species of Xenophrys, and 
those of the outgroups Leptobrachella oshanensis, L. 
ventripunctata, Leptobrachium boringii, Megophrys nasuta, 
and M. baluensis, were downloaded from GenBank 
(Supplementary Table S1). Trees were reconstructed using 
maximum likelihood (ML) and Bayesian inference (BI) 
(Supplementary Methods). Apart from tree-based methods, 
we also calculated row pairwise sequence divergence using 
uncorrected  p-distances implemented in MEGA v6.0.6 
(Tamura et al., 2013). 

The results of the ML and BI analyses yielded essentially 
identical topologies with relatively high nodal support values 
for most terminal nodes (Figure 2). The tree resolved 
monophyly of Xenophrys with two major lineages, 
corresponding to the subgenera Xenophrys sensu stricto and 
Panophrys (Figure 2). Xenophrys cf. "minor" from Chiang Mai 





Province assigned to  Panophrys and phylogenetic 
relationships within which remained essentially unresolved. 
The Chiang Mai population of Xenophrys cf. "minor" was a 
strongly supported lineage (BPP=1, BS=100; Figure 2), which 
different notably from X. minor sensu stricto from China 
(Figure 2). Genetic distance between the Chiang Mai 
population and other species of Xenophrys ranged from 4.5% 
(X. rubrimera, subgenus  Panophrys) to 144% (X. 
mangshanensis, subgenus Xenophrys) (Supplementary Table 
S2). The tree nested X. /atidactyla within the radiation of X. 
palpebralespinosa with high support (BPP=1.0, BS=94) 
(Figure 2). The genetic divergence between X. /atidactyla and 
X. palpebralespinosa was 0.996. 

The Chiang Mai population of Xenophrys cf. "minor" differs 
in a number of taxonomically important diagnostic characters 
from other congeners, including X. minor from China. Thus, 
both mtDNA and morphological analyses clearly indicate that 
this population represents a separately evolving lineage and 
an undescribed species, which we describe below. 


Taxonomic account 
Xenophrys angka sp. nov. 
Figures 3-4; Table 1. 


Chresonymy: Xenophrys minor (partim) — Chan-ard, 2003; 
Nabhitabhata and Chan-ard, 2005; Nabhitabhata et al., 2000; 
Chuaynkern and Chuaynkern, 2012. 


Holotype: KIZ040591, an adult female collected from Kiew 
Mae Pan nature trail in Doi Inthanon, Chiang Mai Province, 
Thailand (N18.556187°, E98.482229°; elevation 2 190 m a.s. 
l.) collected by Chatmongkon Suwannapoom, Parinya 
Pawangkhanant and Nikolay A. Poyarkov on 29, August, 2017. 


Paratypes: Two males KIZ040595 and AUP-00077, three 
females KIZ040592, AUP-00076 and AUP-00055; collected at 
the same locality and same collection information as the 
holotype. 


Etymology: The specific epithet "angka" is given as a noun in 
apposition and refers to the name of the highest mountain of 
Thailand, Doi Angka, located in the Doi Inthanon, Chiang Mai 
Province, Thailand, where the new species occurs. 


Diagnosis: Xenophrys angka sp. nov. is a member of genus 
Xenophrys based on the following combination of 
morphological characters: head large, somewhat narrow and 
comparatively non-compressed, angular; tympanum distinct; 
vertical pupil; transverse skin fold at head basis absent; no 
large horny spines on dorsum; and two narrow glandular mid- 
dorsal ridges present, forming X, H or Y-shaped figure (Chen 
et al., 2017). The following combination of characters 
diagnoses the new species: (1) small body size, adult snout- 
vent length (SVL) 31.2-32.1 mm in males (n=2), 37.5-39.2 mm 
in females (n=4); (2) tympanum distinct and circular; 
(3) vomerine ridges indistinct and vomerine teeth absent; 
(4) maxillary teeth present; (5) tongue heart-shaped, not 
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Figure 2 BI tree resulting from 541 bp length fragment of mitochondrial 16S rRNA gene for Xenophrys species and outgroups 
Bayesian posterior probabilities (BPP)>95%/ML inferences (ML-BS)>70% are shown for each node. Hyphen (“-”) denotes a Bayesian posterior 
probabilities<95% and bootstrap support<70%. 
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Figure 3 Holotype Xenophrys angka sp. nov. (KIZ040591, field 
number AUP-00074, female) in life 

A: Dorsal view; B: Ventral view; C: Head, lateral view; D: Volar view of 
the left hand; E: Plantar view of the left foot. Photos by N.A. Poyarkov. 


notched posteriorly; (6) supratympanic fold distinct, extending 
from the posterior corner of eye to shoulder; (7) webbing 
between toes rudimentary; (8) lateral fringes on toes absent; 
(9) tibio-tarsal articulation reaching the area between eye and 
snout tip; (10) nuptial pads present on finger I; (11) subarticular 
tubercles present on the base of fingers |—1l, but absent on 
fingers III — IV; (12) subarticular tubercle present at base of 
toes |, absent on toes II-IV; (13) heels meeting or overlapping 
when tibias positioned at right angle to body axis; (14) inner 
metatarsal tubercle big, outer metatarsal tubercle absent; (15) 
protruding projection posterior to cloaca of males present; (16) 
dorsal surface with a complete dark brown interorbital triangle 
with light blotch in the middle and two distinct thin opposing 
"V" -shaped reddish glandular ridges with ridge on 
dorsum; (17) orange coloration of groin contrasting with 
surrounding regions on males; and (18) inner metacarpal 
tubercle and outer metatarsal tubercles distinct reddish color. 


Description of holotype (measurements in Table 1): 
KIZ040591, sexually mature female, body habitus stocky 
(Figure 3), body size small (SVL=38.2 mm); head large (head 


Figure 4 Type series of Xenophrys angka sp. nov. in life 

A: Holotype KIZ040591, female; B: Paratype AUP-00055, female; C: 
Paratype AUP-00077, male. Photos by N. A. Poyarkov and C. 
Suwannapoom. 


length (HDL)/SVL 33.896, maximum head width (HDW)/SVL 
32.296), slightly longer than wide (HDW/HDL 95.396); 
triangular in dorsal view; top of head flat; snout short (snout 
length (SNT)/HDL 32.6%) and wide (the distance between 
anterior orbital borders (IFE)/)HDW 50.4%), snout obtusely 
pointed in dorsal view (Figure 3A), sharply protruding in 
profile, without rostral appendage, notably projecting beyond 
lower jaw (Figure 3C); loreal region vertical and concave; 
canthus rostralis distinct, sharp; dorsal region of snout 
flattened; eyes large (eye diameter (ED)/HDL 31.8%); slightly 
protuberant in dorsal view and notably protruding in profile 
(Figure 3), eye less than twice as long as maximum 
tympanum diameter (tympanum diameter (TD)/ED 56.196) and 
subequal to snout length (ED/SNT 97.696); eye-tympanum 
distance less than maximum tympanum diameter (tympanum- 
eye distance (TED)/TD 87.0%); tympanum distinct, circular in 
shape, relatively small (TD/HDL 17.896), eye diameter notably 
larger than tympanum diameter (ED/TD 178.396); nostril 
rounded, laterally orientated, nostril closer to anterior corner of 
eye than to tip of snout (distance from nostril to eye (DNE)/ 
snout-nostril distance (SN) 85.796); internarial distance equal 
to width of upper eyelid (internarial distance (IND)/width of 
upper eyelid (UEW) 100.096), slightly larger than interorbital 
distance (IND/interorbital distance (IOD) 103.096); pineal 
ocellus not visible externally (Figure 3A); tongue heart- 
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Table 1 Measurements (in mm) of type series of Xenophrys 
angka sp. nov. 





KIZ KIZ AUP- AUP- KIZ AUP- 
040591* 040592 00076 00055 040595 00077 

Sex F F F F M M 
SVL 38.2 37.5 38.2 39.2 32.1 31.2 
HDL 12.9 12.8 12.7 13.0 11.3 11.5 
HDW 12.3 12.6 12.4 12.6 10.5 11.0 
SNT 42 4.4 4.0 4.1 3.5 3.7 
DNE 1.8 ET, 1.7 1.8 1.4 1.5 
IND 3.4 3.2 3.7 3.7 3.1 3.3 
IOD 3.3 3.2 3.2 3.5 3.0 2.7 
UEW 34 3.1 3.2 3.2 2.5 2.9 
ED 4.1 4.0 4.2 4.1 3.9 3.7 
TD 2.3 1.8 1.8 2.3 1:7 1.8 
TED 2.0 1.4 1.4 2.1 1.5 1.3 
SN 2.1 2.5 24 2.2 2.0 2.1 
IFE 6.2 6.4 6.6 6.6 5.3 5.3 
IBE 10.4 10.1 10.0 9.6 9.0 9.2 
FAL 7.8 8.0 7.4 8.6 6.4 6.4 
HL 9.4 9.7 9.7 9.3 8.7 8.6 
THL 17.6 18.5 18.1 17.6 14.8 14.0 
TL 18.7 18.8 18.7 18.6 16.0 15.5 
FL. 15.7 16.8 17.2 16.5 15.6 14.3 
IMTL 1.6 2.4 1.9 Taf 1.4 14 
FLI 3.6 3.5 3.6 3.8 3.5 3.9 
FLII 4.1 4.1 4.1 4.2 3.8 4.1 
FLII 6.2 6.3 6.4 6.5 6.0 6.5 
FLIV 44 4.2 4.2 44 4.1 4.3 
TLI 2:1 2.3 2.2 22 24 2.3 
TLII 4.2 4.4 5.0 4.6 4.2 3.9 
TLI — 7.0 6.1 6.9 7.0 6.0 5.6 
TLIV 9.8 10.1 10.4 10.4 8.7 8.2 
TLV 5.3 5.5 5.6 5.1 4.7 4.0 


For abbreviations, see text and Supplementary Methods. Asterisk (*) 
indicates holotype. F: Female; M: Male. 


shaped, not notched posteriorly; vomerine ridges indistinct 
and vomerine teeth absent; maxillary teeth present; pupil 
diamond-shaped (Figure 3C), vertical. 

Forelimbs moderately long and robust; forearm not 
enlarged, length shorter than hand length (forearm length 
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(FAL)/hand length (HL) 83.0%); fingers long and narrow, not 
flattened dorsoventrally, lateral fringes on fingers absent, 
relative finger lengths: I«II«IV«III; tips of all fingers rounded, 
slightly dilated relative to digit widths, with circular pads, 
terminal grooves absent; no webbing between fingers; a large 
subarticular tubercle present at base of fingers I-1l, absent on 
fingers Ill — IV; more distal subarticular tubercles absent, 
replaced by low callous dermal ridges; supernumerary 
tubercles absent; inner metacarpal tubercle big, oval, outer 
metacarpal tubercle small, flattened (Figure 3D). 

Hindlimbs long and robust, thigh length shorter than tibia 
length (thigh length (THL)/tibia length (TL) 94.196), but greater 
than foot length (THL/foot length (FL) 112.196); tibio-tarsal 
articulation of straightened limb reaching eye level; heels 
slightly overlapping when tibias positioned at right angles to 
body axis; toes long and slightly dorsoventrally flattened, 
relative toes lengths: I«II«V«III«IV; tips of all toes rounded, 
slightly dilated, terminal grooves absent; notably expanded 
relative to digit widths forming circular pads; terminal grooves 
absent; lateral dermal fringes on absent; rudimentary webbing 
present between all toes; tarsal fold absent; subarticular 
tubercle present at base of toes |, absent on toes Il- IV, 
replaced by indistinct callous dermal ridges; inner metatarsal 
tubercle large, ca. 1.5x longer than wide, oval-shaped; and 
outer metatarsal tubercle absent (Figure 3E). 

Dorsal surface of body and both dorsal and lateral surfaces 
of head weakly granular; several large distinct warts scattered 
on flanks; horn-like tubercle and several smaller tubercles at 
edge of eyelids present; and supratympanic fold distinct, 
glandular, starting immediately at posterior corner of upper 
eyelid (palpebrum) and running posteriorly towards dorsal 
edge of tympanum, where it sharply curves ventrally 
becoming more prominent and swollen and gently continues 
towards axilla. Dorsolateral folds well-developed, glandular, 
almost straight, running from scapular region posteriorly 
towards sacral region becoming less distinct and interrupted 
posteriorly; two opposing "V" -shaped glandular skin folds 
present on dorsum joined by a ca. 10 mm long dorsomedial 
fold in a hourglass-shape (Figure 3A); dorsal surfaces of limbs 
with small tubercles forming distinct transverse skin folds on 
hindlimbs and irregular reticulate folds on forelimbs; ventral 
surfaces of limbs, chest, abdomen and throat smooth; 
pectoral glands prominent, rounded, located closely to axilla 
(Figure 3D); femoral glands small, oval-shaped, positioned on 
posterior surface of thighs closer to groin than to the knee. 


Color of the holotype in life: Coloration of holotype in life 
shown in Figure 3 and Figure 4A. In life, dorsal surface light 
brown with olive green tint, with a complete dark brown 
inverted triangle with light-ochre central blotch present 
between eyes; all small tubercles and glandular ridges on 
dorsal surfaces of head and body reddish-brown, hourglass- 
shaped dorsal glandular skin ridges also reddish brown 
surrounded with darker grey-brown "X"-shaped marking with 
indistinct borders; flanks with irregular dark brown spots and 
reddish mottling getting denser ventrally (Figure 4A); lateral 
surfaces of head light brownish-gray to beige, upper lips with 


3 to 4 dark reddish-brown vertical bars, the one below eye 
largest and most distinct; tympanum entirely dark brown; white 
supratympanic fold with black to dark-brown lower margin; 
loreal region dark-brown; the edge of upper eyelid 
(palpebrum) with 3 distinct reddish palpebral tubercles, the 
medial one largest forming horn-like projection; dorsally 
palpebral tubercles edged with dark brown; dorsal parts of 
limbs reddish-brown with dark brown crossbars and reddish 
mottling; dorsal surface of the fingers with dark brown 
crossbars; ventral surfaces of body and limbs primarily off- 
white to grey, mottled with dense black and white blotches and 
flecks on belly; black and white marbling getting denser 
anteriorly; gular and chest dark grey and mottled with white; 
posterior part of belly off-white with large grey blotches; 
ventral surfaces of hindlimbs with contrasting black and white 
marbling; groin area, anterior surfaces of thighs and posterior 
surfaces of heels bright reddish-orange in life; pectoral and 
femoral glands small, cream-white in color; iris golden-bronze 
with dense black mottling; pupil edged with thin golden line; 
thenar and palmar surface of limbs dark grey; and inner 
metacarpal tubercle, outer metacarpal tubercle, and inner 
metatarsal tubercle bright pink. 


Color of the holotype in preservative: For coloration see 
Supplementary Figure S1. After 2 years in preservative, 
dorsum faded to dark brown; a complete dark brown inverted 
triangle between eyes and crossbars on dorsal limbs and 
fingers still clear; two opposing "V"-shaped skin folds present 
on dorsum becoming less distinct; throat and chest turns dark 
brown; abdomen and ventral surface of limbs dark brown 
mottled with whitish-grey marbling; inner metacarpal tubercle 
and inner metatarsal tubercle become off-white, outer 
metacarpal tubercle brown; dark brown vertical bars present 
upper lips still clear; supratympanic folds dark brown; pectoral 
glands and femoral glands still distinct, cream; ventral side of 
the hands and toes brown and digit tips brown or cream. 


Variation: Morphometric variation of types reported in Table 1. 
All 5 paratypes are very similar in morphology the holotype, 
but also show variation in color in life. Dorsal surface of 
holotype and paratype AUP-00055 light brown, but paratype 
AUP-00077 orange (Figure 4). 


Male secondary sexual characteristics: The new species 
shows slight differences in body size between the sexes: 
females have slightly larger SVL (37.5 — 39.2 mm, mean 
38.3 mm; n=4) than males (31.2-32.1 mm, mean 31.7 mm; n= 
2). All adult males have nuptial pads pinkish-red in life 
covering the dorsal surface of the base of Fl. Male has 
external, single subgular vocal sac with slit-like openings at 
posterior corners of jaws and a prominent protruding 
projection posterior to cloaca (Figure 4). 


Distribution and ecology: Xenophrys angka sp. nov. is 
currently known only from Doi Inthanon, Chiang Mai province, 
Thailand (Figure 1) at elevations from 1800 to 2200 m a.s.l. 
along forest cascade streams and waterfalls. Most males were 


located by calls made while sitting on vegetation 1—2 m from 
the stream; females were recorded in forest litter along the 
streams. Xenophrys angka sp. nov. appears to be a strict 
forest specialist, restricted to patches of undisturbed montane 
evergreen forests and is likely endemic to the Doi Inthanon — 
Thanon Thong Chai Range. It inhabits forest floor, leaf litter 
and the nearby undergrowth rocky mountainous surrounded 
by moist evergreen broadleaved forests. The new species 
was found in sympatry with Leptobrachium huashen Fei et Ye 
and Limnonectes taylori Matsui, Panha, Khonsue et Kuraishi. 


Comparisons: We compared Xenophrys angka sp. nov. with 
their 66 known congeners on the basis of morphology. 
Comparisons with each subgenus are discussed separately 
below. 

Subgenus Panophrys: Xenophrys angka sp. nov. differs 
from the following large-sized species by having a smaller 
adult male size, SVL 31.2-32.1 mm (vs. adult-male 42.0—45.0 
mm, n=5, in X. baolongensis, Ye et al., 2007; 45.0—51.0 mm, 
n=3, in X. binlingensis, Fei et al., 2009; 81.3 mm, n=1, in X. 
caudoprocta, Fei et al., 2009; 53.0 — 56.5 mm, n=2, in X. 
jingdongensis, Fei et al., 2009; 56.0—59.5 mm, n=10, in X. 
omeimontis, Fei et al, 2009; 54.7 mm, n=1, in X. 
sangzhiensis, Jiang et al., 2008; 99.8— 115.6 mm, n=6, in X. 
shuichengensis, Tian et al., 2000; and 47.2—54.4, n=18, in X. 
spinata, Fei et al., 2009); from X. liboensis by having a smaller 
adult female body size, SVL 37.5-39.2 mm (vs. 60.8-70.6 mm, 
n=8, in X. liboensis, Zhang et al., 2017); further from X. acuta 
by meeting or overlapping when tibias positioned at right 
angle to body axis (vs. not meeting in X. acuta, Li et al., 
2014), and head slightly longer than wide (vs. head length 
slightly shorter than head width in X. acuta, Li et al., 2014); 
from X. binchuanensis by lateral dermal fringes on toes 
absent (vs. lateral dermal fringes on toes wide in X. 
binchuanensis, Fei et al., 2009), and horn-like tubercle above 
eyelids present (vs. absent in X. binchuanensis, Fei et al., 
2009); from X. boettgeri by tongue not notched behind (vs. 
tongue notched behind in X. boettgeri, Fei et al., 2009), lateral 
dermal fringes on absent (vs. present in X. boettgeri, Fei et 
al., 2009), and male with external single subgular vocal sac 
(vs. male with internal single subgular vocal sac in X. 
boettgeri, Fei et al., 2009); from X. brachykolos by head 
slightly longer than wide (vs. head width larger than head 
length in X. brachykolos, Fei et al., 2009), male with external 
single subgular vocal sac (vs. male with internal single 
subgular vocal sac in X. brachykolos, Fei et al., 2009), and 
heels meeting or overlapping (vs. not meeting in X. 
brachykolos, Fei et al., 2009); from X. cheni by lateral fringes 
on fingers and toes absent (vs. lateral fringes on figures and 
toes wide in X. cheni, Wang et al., 2014), a large subarticular 
tubercle present at base of fingers I—Il, absent on fingers III—IV 
(vs. subarticular tubercles indistinct in X. cheni, Wang et al., 
2014), golden-bronze (vs. iris dark brown in X. cheni, Wang et 
al., 2014), and tongue not notched behind (vs. margin of 
tongue notched behind in X. cheni, Wang et al., 2014); from X. 
daweimontis by vomerine ridges indistinct and vomerine teeth 
absent (vs. present in X. daweimontis, Rao & Yang, 1997), 
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and male with external single subgular vocal sac (vs. male 
with internal vocal sac in X. daweimontis, Rao & Yang, 1997); 
from X. dongguanensis and X. nankunensis by vomerine 
ridges indistinct and vomerine teeth absent (vs. strong 
vomerine ridge bearing vomerine teeth in X. dongguanensis 
and X. nankunensis, Wang et al., 2019), subarticular tubercle 
present at base of toes I, absent on toe II-IV (vs. subarticular 
tubercles only present at base of each toe in X. 
dongguanensis and X. nankunensis, Wang et al., 2019), and 
heels meeting or overlapping (vs. heels not meeting in X. 
dongguanensis and X. nankunensis, Wang et al., 2019); from 
X. fansipanensis and X. hoanglienensis by vomerine ridges 
indistinct and vomerine teeth absent (vs. present in X. 
fansipanensis and X. hoanglienensis, Tapley et al., 2018), 
inner and outer metacarpal tubercles present (vs. absent in X. 
fansipanensis and X. hoanglienensis, Tapley et al., 2018), a 
large subarticular tubercle present at base of fingers |- Il, 
absent on fingers III-IV (vs. absent in X. fansipanensis and X. 
hoanglienensis, Tapley et al., 2018), inner metatarsal tubercle 
large (vs. inner metatarsal tubercle very weakly in X. 
fansipanensis and X. hoanglienensis, Tapley et al., 2018), and 
protruding projection posterior to cloaca of males present (vs. 
absent in X. fansipanensis and X. hoanglienensis, Tapley et 
al., 2018); from X. huangshanensis by tympanum distinct (vs. 
indistinct in X. huangshanensis, Fei et al., 2009), male with 
external single subgular vocal sac (vs. male with internal 
single subgular vocal sac in X. huangshanensis, Fei et al., 
2009), subarticular tubercle present at base of toes |, absent 
on toe II —IV (vs. present in X. huangshanensis, Fei et al., 
2009), and heels meeting or overlapping (vs. heels not 
meeting in X. huangshanensis, Fei et al., 2009); from X. 
insularis by vomerine ridges indistinct and vomerine teeth 
absent (vs. vomerine ridge strong with vomerine teeth in X. 
insularis, Wang et al., 2017), subarticular tubercle present at 
base of toes |, absent on toe II—IV (vs. subarticular tubercle 
only present at base of each toe in X. insularis, Wang et al., 
2017), heels meeting or overlapping (vs. not meeting in X. 
insularis, Wang et al., 2017), and tibio-tarsal articulation 
reaching area between eye and snout tip (vs. reaching 
forward to posterior edge of tympanum in X. insularis, Wang 
et al., 2017); from X. jinggangensis by subarticular tubercle 
present at base of toes |, absent on toe II —IV (vs. a large 
subarticular tubercle at base of each toe in X. jinggangensis, 
Wang et al., 2012), relative finger lengths: I«II«IV«III (vs. II«I« 
IV<lll in X. jinggangensis, Wang et al., 2012), and lateral 
fringes on fingers absent (vs. present in X. jinggangensis, 
Wang et al., 2012); from X. jiulianensis by tongue not notched 
posteriorly (vs. tongue weakly notched posteriorly in X. 
jiulianensis, Wang et al., 2019), subarticular tubercle present 
at base of toes |, absent on toe I| — IV (vs. subarticular 
tubercles only present at base of toe | and Il in X. jiulianensis, 
Wang et al., 2019); from X. kuatunensis by male with external 
single subgular vocal sac (vs. male with internal single 
subgular vocal sac in X. kuatunensis, Fei et al., 2009), and 
heels meeting or overlapping (vs. not meeting in X. 
kuatunensis, Fei et al., 2009); from X. leishanensis by male 
with external single subgular vocal sac (vs. male with internal 


570  www.zoores.ac.cn 


single subgular vocal sac in X. leishanensis, Li et al., 2018), 
relative finger lengths: lI«II«IV«Ill (vs. ll«I«V«lll in X. 
leishanensis, Li et al., 2018), and head slightly longer than 
wide (vs. head width slightly larger than head length in X. 
leishanensis, Li et al., 2018); from X. lini by subarticular 
tubercle present at base of toes |, absent on toe II—IV (vs. 
subarticular tubercle distinct at base of each toe in X. lini, 
Wang et al., 2014), and lateral dermal fringes on fingers and 
toes absent (vs. wide in X. lini, Wang et al., 2014); from X. 
lishuiensis by male with external single subgular vocal sac (vs. 
male with internal single subgular vocal sac in X. lishuiensis, 
Wang et al., 2017); from X. minor by a large subarticular 
tubercle present at base of fingers I—Il, absent on fingers III-IV 
(vs. absent in X. minor, Fei et al., 2009), subarticular tubercle 
present at base of toes I, absent on toe II-IV (vs. absent in X. 
minor, Fei et al., 2009), lateral dermal fringes on toes absent 
(vs. absent in X. minor, Fei et al., 2009), male with external 
single subgular vocal sac (vs. male with internal single 
subgular vocal sac in X. minor, Fei et al., 2009), and horn-like 
tubercle above eyelids present (vs. absent in X. minor, Fei et 
al., 2009); from X. mufumontana by relative finger lengths: I«II 
«IV«III (vs. IIzIV«I«III in X. mufumontana, Wang et al., 2019), 
subarticular tubercle present at base of toes I, absent on 
toe II—IV (vs. subarticular tubercles only present at base of 
each toe in X. mufumontana, Wang et al., 2019), a large 
subarticular tubercle present at base of fingers I-1l, absent on 
fingers III-IV (vs. presence of a subarticular tubercle at base 
of each finger in X. mufumontana, Wang et al., 2019), and 
outer metacarpal tubercle small, flattened (vs. indistinct in X. 
mufumontana, Wang et al., 2019); from X. nanlingensis by 
vomerine ridges indistinct and vomerine teeth absent (vs. 
vomerine ridge and vomerine teeth present in X. nanlingensis, 
Wang et al., 2019), tongue not notched posteriorly (vs. tongue 
notched posteriorly in X. nanlingensis, Wang et al., 2019), a 
large subarticular tubercle present at base of fingers | Il, 
absent on fingers Ill — IV (vs. presence of a subarticular 
tubercle at base of each finger in X. nanlingensis, Wang et al., 
2019), subarticular tubercle present at base of toes I, absent 
on toe II-IV (vs. presence of a subarticular tubercle at base of 
each toe in X. nanlingensis, Wang et al., 2019), and adult 
males have nuptial pads pinkish-red in life covering dorsal 
surface of base of Fl (vs. nuptial pads and nuptial spines 
invisible in males during breeding season in X. nanlingensis, 
Wang et al, 2019) from X. obesa by vomerine ridges 
indistinct (vs. two vomerine ridges moderately developed in X. 
obesa, Li et al., 2014), heels meeting or overlapping (vs. not 
meeting in X. obesa, Li et al., 2014), and head slightly longer 
than wide (vs. head width slightly larger than head length in X. 
obesa, Li et al., 2014); from X. ombrophila by heels meeting 
or overlapping (vs. not meeting in X. ombrophila, Messenger 
et al, 2019), and tibio-tarsal articulation reaching area 
between eye and snout tip (vs. reaching posterior corner of 
eye in X. ombrophila, Messenger et al., 2019); from X. 
palpebralespinosa by vomerine teeth absent (vs. present in X. 
palpebralespinosa, Fei et al., 2009), lateral dermal fringes on 
fingers and toes absent (vs. slight lateral fringes on fingers, 
lateral fringes wide on toes in X. palpebralespinosa, Fei et al., 


2009), and male with external single subgular vocal sac (vs. 
male with internal single subgular vocal sac in X. 
palpebralespinosa, Fei et al., 2009); from X. rubrimera by 
vomerine teeth absent (vs. present in X. rubrimera, Tapley et 
al., 2017), a large subarticular tubercle present at base of 
fingers | — Il, absent on fingers lll — IV (vs. absence of 
subarticular tubercles on fingers in X. rubrimera, Tapley et al., 
2017), and head slightly longer than wide (vs. head width 
slightly larger than head length in X. rubrimera, Tapley et al., 
2017); from X. tuberogranulata by male with external single 
subgular vocal sac (vs. male with internal single subgular 
vocal sac in X. tuberogranulata, Mo et al., 2010), a large 
subarticular tubercle present at base of fingers I-II, absent on 
fingers III —IV (vs. indistinct in X. tuberogranulata, Mo et al., 
2010), and relative finger lengths: I«II«IV«III (vs. II<IV=I<III in 
X. tuberogranulata, Mo et al., 2010); from X. wugongensis by 
heels meeting or overlapping (vs. not meeting in X. 
wugongensis, Wang et al., 2019), a large subarticular tubercle 
present at base of fingers I—II, absent on fingers III—IV (vs. 
presence of a subarticular tubercle at base of each finger in X. 
wugongensis, Wang et al, 2019), subarticular tubercle 
present at base of toes |, absent on toe II—IV (vs. presence of 
a subarticular tubercle at base of each toe in X. wugongensis, 
Wang et al., 2019); from X. wuliangshanensis by a large 
subarticular tubercle present at base of fingers I-1l, absent on 
fingers III-IV (vs. indistinct in X. wuliangshanensis, Fei et al., 
2009), male with external single subgular vocal sac (vs. male 
with internal single subgular vocal sac in X. wuliangshanensis, 
Fei et al., 2009), horn-like tubercle at edge of eyelids present 
(vs. absent in X. wuliangshanensis, Fei et al., 2009), and 
relative finger lengths:  I<II<IV<IIl (vs. IV<IlII<I=Il in 
wuliangshanensis, Fei et al., 2009); from X. wushanensis by 
horn-like tubercle and some smaller tubercles at edge of 
eyelids present (vs. absent in X. wushanensis, Fei et al., 
2009), male with external single subgular vocal sac (vs. male 
with internal single subgular vocal sac in X. wushanensis, Fei 
et al., 2009), and lateral dermal fringes on toes absent (vs. 
male with wide lateral fringes in X. wushanensis, Fei et al., 
2009). 

Subgenus Xenophrys: Xenophrys angka sp. nov. differs 
from the following large-sized specie by having a smaller adult 
male size, SVL 31.2—32.1 mm (vs. adult-male 55.8-62.4 mm, 
n=6, in X. aceras, Wang et al., 2017; 39.1—45.0 mm, n=8, in 
X. ancrae, Mahony et al, 2013; 76.7 mm, n-20, in X. 
auralensis, Ohler., 2002; 57.1 mm, n=1, in X. damrei, Mahony, 
2011; 56.9-68.4 mm, n=4, in X. flavipunctata, Mahony et al., 
2018; 76.3-81.0 mm, n=10, in X. glandulosa, Fei et al., 2009; 
68.0—73.5 mm, n=7, in X. himalayana, Mahony et al., 2018; 
55.6—-66.6 mm, n=8, in X. lekaguli, Stuart et al., 2006; 47.0 mm, 
n=1, in X. longipes, Wang et al., 2017; 65.5 mm, n=1, in X. 
major, Fei et al., 2009; 62.5 mm, n=1, in X. mangshanensis, 
Fei et al., 2009; 58.0—76.0 mm, n=6, in X. maosonensis, Yang 
et al., 2018; 57.2-68.0 mm, n=16, in X. medogensis, Fei et al., 
2009; 71.3 — 93.8 mm, n=12, in X. periosa, Mahony et al., 
2018; 73.5-83.1 mm, n=6, in X. robusta, Mahony et al., 2018; 
45.9—53.4 mm, n=7, in X. megacephala, Mahony et al., 2011; 
47.4—53.0 mm, n=33, in X. takensis, Mahony, 2011); from X. 


oreocrypta by having a smaller adult female body size, SVL 
37.5—39.2 mm (vs. 94.9 mm, n=1, in X. oreocrypta, Mahony et 
al, 2018); further from X. monticola by vomerine ridges 
indistinct (vs. distinct in X. monticola, Mahony et al., 2018), a 
large subarticular tubercle present at base of fingers | Il, 
absent on fingers III-IV (vs. absent in X. monticola, Mahony et 
al., 2018), inner and outer metacarpal tubercles distinct (vs. 
absent in X. monticola, Mahony et al., 2018), and inner 
metatarsal tubercle large (vs. indistinct in X. monticola, 
Mahony et al., 2018); from X. oropedion by vomerine ridges 
indistinct and vomerine teeth absent (vs. present in X. 
oropedion, Mahony et al., 2013), a large subarticular tubercle 
present at base of fingers I—Il, absent on fingers III—IV (vs. 
absent in X. oropedion, Mahony et al., 2013), protruding 
projection posterior to cloaca of males present (vs. absent in 
X. oropedion, Mahony et al., 2013), and inner and outer 
metacarpal tubercles distinct (vs. absent in X. oropedion, 
Mahony et al., 2013); from X. pachyproctus and X. parva by 
vomerine ridges indistinct and vomerine teeth absent (vs. 
present in X. pachyproctus and X. parva, Fei et al., 2009), a 
large subarticular tubercle present at base of fingers |- ll, 
absent on fingers III-IV (vs. absent in X. pachyproctus and X. 
parva, Fei et al., 2009), and male with external single subgular 
vocal sac (vs. male with internal single subgular vocal sac in 
X. pachyproctus and X. parva, Fei et al., 2009); from X. 
serchhipii by toes with rudimentary webbing (vs. at least one- 
fourth webbing in X. serchhipii, Mathew & Sen, 2007), relative 
finger lengths: I«II«IV«III (vs. 2nd and 4th fingers subequal in 
length in X. serchhipii, Mathew & Sen, 2007), and vomerine 
teeth absent (vs. present in X. serchhipii, Mathew & Sen, 
2007); from X. vegrandis by a large subarticular tubercle 
present at base of fingers I—1Il, absent on fingers III—IV (vs. 
absent in X. serchhipii, Mathew & Sen, 2007), protruding 
projection posterior to cloaca of males present (vs. absent in 
X. serchhipii, Mathew & Sen, 2007), and pupil vertical, iris 
golden-bronze (vs. pupil horizontally orientated, iris metallic 
yellowish-orange in X. serchhipii, Mathew & Sen, 2007); from 
X. zhangi by vomerine ridges indistinct and vomerine teeth 
absent (vs. present in X. zhangi, Fei et al., 2009), a large 
subarticular tubercle present at base of fingers I-1l, absent on 
fingers III-IV (vs. absent in X. zhangi, Fei et al., 2009), lateral 
dermal fringes on toes absent (vs. present in X. zhangi, Fei et 
al., 2009), and male with external single subgular vocal sac 
(vs. male with internal single subgular vocal sac in X. zhangi, 
Fei et al., 2009); from X. zunhebotoensis by horn-like tubercle 
and a number of smaller tubercles at edge of eyelids present 
(vs. absent in X. zunhebotoensis, Mathew & Sen, 2007), 
tongue not notched behind (vs. tongue notched behind in X. 
zunhebotoensis, Mathew & Sen, 2007), and vomerine teeth 
absent (vs. present in X. zunhebotoensis, Mathew & Sen, 
2007). 

From species not yet assigned to a subgenus: Xenophrys 
angka sp. nov. differs from X. feii by lateral dermal fringes on 
toes absent (vs. lateral fringes on toes moderate to wide in X. 
feii, Yang et al., 2018), lateral fringes on fingers absent (vs. 
moderate lateral fringes present on outer most three fingers in 
X. feii, Yang et al., 2018), groin coloration in life contrasting 
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with surrounding regions on males (vs. groin coloration not 
contrasting with surrounding regions on males in X. feii, Yang 
et al., 2018), and nuptial pads present on finger | (vs. absent 
in X. feii, Yang et al., 2018). 

The diversity of Xenophrys is underestimated greatly (Chen 
et al., 2017; Liu et al., 2018). In the last three years, 19 new 
species were described mostly from China, Vietnam and India 
(Li et al., 2018; Mahony et al., 2018; Messenger et al., 2019; 
Tapley et al., 2017, 2018Wang et al., 2017; Wang et al., 2017; 
Wang et al., 2019; Yang et al., 2018; Zhang et al., 2017). 
Compared to other countries in the Indochinese region, the 
diversity of Xenophrys in Thailand attracted less attention and 
no new species were described for the country within the last 
seven years. Herein, we describe a new species of Xenophrys 
from northern Thailand based on morphological and molecular 
analyses. Our discovery increases the number of amphibian 
species recorded in Thailand to 194, and the species number 
of Xenophrys to 27. The diversity of amphibians known from 
Thailand has increased remarkably from 125 (Khonsue and 
Thirakhupt, 2001) to 193 (Frost, 2019) and the diversity 
remains underestimated (Chan et al., 2018; Chen et al., 2017, 
2018; Grismer et al., 2016; Laopichienpong et al., 2016; 
Matsui et al., 2018; Pawangkhanant et al., 2018; Poyarkov et 
al 2018; Sheridan & Stuart, 2018; Suwannapoom et al., 
2017; Suwannapoom et al., 2018). Further field surveys and 
taxonomic studies on the Thai herpetofauna will likely result in 
further discoveries of yet unknown lineages and species of 
amphibians. 

Our study also elucidates taxonomic status of Xenophrys 
latidactyla (Orlov et al., 2015), which a recently described from 
northern Vietnam. It was described as a member of 
Megophrys sensu lato based on a single male specimen 
collected from Pu Mat National Park in south-western Nghe 
An Province. Orlov et al. (2015) indicated that this species is 
most closely resembles X. palpebralespinosa, inhabiting 
montane areas of northern Vietnam (recorded from provinces 
Lao Cai, He Giang, Cao Bang, Vinh Phuc, Son La, Thanh 
Hoa, Nge An) and northern Laos, eastern part of Yunnan and 
the westernmost part of Guangxi, China (Bourret, 1937, 1942; 
Fei et al., 2009, 20100rlov et al., 2015; Tapley et al., 2017). 
The main morphological differences of these two species were 
the presence of very wide dermal fringes along all length of 
the toes in X. latidactyla (vs. broad dermal fringes absent in X. 
palpebralespinosa), presence of prominent  subarticular 
tubercles in X. latidactyla (vs. absent), and differences in 
tympanum size. Our mtDNA genealogy unambiguously places 
two specimens of X. /atidactyla, including the type specimen 
ZISP 12182, within radiation of X. palpebralespinosa from 
northern Vietnam and Laos. The genetic divergence between 
X. latidactyla and X. palpebralespinosa is small, ranging from 
0.396—1.496, suggesting that these two taxa might be 
conspecific. Morphological differences that were used by 
Orlov et al. (2015) to diagnose these species might be subject 
to significant variation, which was not assessed due to small 
sample size (n=1): feet webbing and dermal fringes on toes in 
X. palpebralespinosa are normally significantly reduced out of 
the breeding season and get enlarged during the reproduction 


572 www.zoores.ac.cn 


(personal observation); the reported distinct subarticular 
tubercles in X. latidactyla might be the result of partial 
dehydration of limbs due to preservation in ethanol. Thus, no 
unambiguous morphological or molecular characters 
distinguishing these two species. Therefore, we transfer X. 
latidactyla to the synonymy of X. palpebralespinosa as a 
subjective junior synonym. Further, this result indicates that 
molecular analyses are essential in assessing species 
diversity of Xenophrys, and megophryids in general. 
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ZOOLOGICAL RESEARCH 


First record of the ferret-badger Melogale 
cucphuongensis Nadler et al., 2011 (Carnivora: 
Mustelidae), with description of a new subspecies, in 


southeastern China 


DEAR EDITOR, 


The ferret-badger Melogale cucphuongensis was first 
described from Cuc Phuong National Park in Vietnam, with no 
subsequent reports in later years. During our surveys of the 
Wuyishan Mountains of Fujian Province in southeastern China 
during May 2018, a Melogale specimen was identified. 
Analysis based on pelage and skull characteristics as well as 
molecular data indicated it to be a new subspecies, which we 
nominated as Melogale cucphuongensis guadunensis subsp. 
nov. This is the first record of the species in China. Therefore, 
this research not only expands the distribution range of the 
species beyond Vietnam, but also indicates that geographic 
variation of the species should have been proceeding in 
southeastern China. 

Ferret-badgers (Melogale |. Geoffroy, 1831) were originally 
assigned into four species distributed in Indochina, Java, Bali, 
China, and northeastern Borneo (Corbet & Hill, 1992; Wilson 
& Reeder, 2005). Among them, Melogale everetti Thomas, 
1895 is restricted to Mount Kina Balu in northern Borneo, M. 
orientalis Horsfield, 1821 occurs in Indonesia, M. personata 
Geoffroy, 1831 is distributed from the southern Indo-China 
Peninsula northward to Assam, including southwestern China, 
and M. moschata Gray, 1831 is found in China (from 
southwestern to eastern areas, including Hainan and Taiwan), 
India, Laos, and Vietnam (Corbet & Hill, 1992; Smith & Xie, 
2009; Wang, 2003; Wilson & Reeder, 2005). Currently, there 
are only two Melogale species, M. personata and M. 
moschata, distributed in China. Based on skull morphology, 
pelage characteristics, and mitochondrial cyt b gene analyses, 


Nadler et al. (2011) described a new species (M. 
cucphuongensis) from Vietnam, which occurs sympatrically 
with M. moschata and M. personata. However, no subsequent 
reports on these or new species have occurred in recent years. 

We conducted a mammal survey in the Wuyishan 
Mountains of southeastern China in May 2018. We discovered 
one male Melogale individual (collection number: 2018000079) 
with a crushed head, which had died on a road near Guadun 
village (E177.64°, N27.73°, Altitude: 1074 m a.s.l.). 

We described the pelage and dentition characteristics 
(despite the damaged skull) and carried out morphological 
measurements. In total, four cranial measurements were 
taken with a digital caliper to an accuracy of 0.01 mm, 
including: upper tooth row (UTR), lower tooth row (LTR), P4: 
length of P4, P1-M1: length from front edge of P1 to end of 
M1. Body weight (BW) and four external measurements were 
also taken, including: body length (BL), tail length (TL), hind 
foot length (HF), and ear length (EL). 

Total genomic DNA was extracted from muscle tissue 
collected from the specimen. The tissue sample was digested 
using proteinase K, and subsequently purified following 
standard phenol/chloroform isolation and ethanol precipitation. 
A fragment encoding the complete cyt b gene was amplified 
using primer pairs L14724: 5-CGAAGCTTGATATGAAAAACC 
ATCGTTG-3', and H15915: 5-AACTGCAGTCATCTCCGGTT 
TACAAGAC-3' (Irwin et al., 1991). Polymerase chain reaction 
(PCR) amplification was performed in 50 pL reactions under 
the following cycling conditions: initial denaturing step at 95 *C 
for 4 min; 35 cycles of denaturing at 94 °C for 60 s, annealing 
at 51 °C for 60 s, extension at 72 °C for 60 s; and final 
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extension at 72 °C for 10 min. Sequencing was conducted 
using the corresponding PCR primers, with the new sequence 
deposited in GenBank under Accession No. MK167045. 
Available homologous sequences of other species were 
obtained from GenBank. Taxidea taxus was selected as the 
outgroup. 

Sequences were aligned using MUSCLE with default 
parameters in MEGA 7 (Kumar et al., 2016). Uncorrected 
pairwise distances were calculated in MEGA 7. The best 
substitution model was selected using the Akaike Information 
Criterion (AIC) in MODELTEST v3.7 (Posada & Crandall, 
1998). Bayesian inferences were performed in MRBAYES 
3.1.2 (Huelsenbeck & Ronquist, 2001) based on the selected 
substitution models (GTR+I+G). Two runs were performed 
simultaneously with four Markov chains starting from a 
random tree. The chains were run for 3 000 000 generations 
and sampled every 100 generations. The first 2596 of sampled 
trees were discarded as burn-in when the standard deviation 
of split frequencies of the two runs was less than 0.01. The 
remaining trees were then used to create a consensus tree 
and estimate Bayesian posterior probabilities (BPPs). 


Taxonomy 
Melogale cucphuongensis guadunensis Li et al., subsp. nov. 


Holotype: KIZ 2018000079, 5, adult, collected on 13 May 
2018, from Guadun (E177.64°, N27.73^, Altitude: 1074 m a.s. 
I), Wuyishan, Fujian, China. The holotype is preserved at the 
Kunming Natural History Museum of Zoology, Kunming 
Institute of Zoology, Chinese Academy of Sciences, Yunnan, 
China. 


Etymology: The new subspecies is named according to the 
type locality. 


Diagnosis: Cheeks and rear of eye yellow-brown; edge of ear- 
conch yellow-brown; small whitish patch between eyes and on 
ears respective, and on the neck back also; tail pale brown all 
around with whitish hairs tips, transforming to yellow-brown at 
end. P^ ~1/3 length of P'-M' (length from front edge of P' to 
end of M’). 


Description: Head brown-black, deeper in color than back; 
naked nose terminal elongated and wedge-like, mouth 
positioned clearly behind nose. Vibrissae dark brown and 
long, anteorbital area black-brown. Small whitish patch 
between eyes and at vertex between ears. Frontal part brown. 
Cheeks and rear of eye yellow-brown; edge of ear-conch 
yellow-brown (Figure 1). 





Figure 1 Holotype of Melogale cucphuongensis guadunensis Li et al., subsp. nov. 


General color of back brown with long frosted-like, whitish 
hair tips; whitish patch on center of back of neck, with no 
whitish stripe from center of occiput backward to neck and 
shoulders. Under part of body yellow-cream. Chin and throat 
thinner than abdomen (Figure 1). 

Feet covered in brown hairs overall, but naked on soles; 
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forefeet claws stronger than those of hindfeet and slightly 
curved. Tail bushy, pale brown all around with whitish hair tips, 
transforming to yellow-brown at end (Figure 1). 

As noted above, although the skull was damaged, most 
teeth were preserved. Dental formula: 13/3, C1/1, P4/4, M1/2= 
38. C! developed, height twice that of P; P! smallest cheek 


teeth; P? nearly same as Pš, but less than latter more or less; 
P^ well developed, mesostyle markedly larger than paracone, 
metacone elongated backward significantly, forming sharp 
dental ridge, outer edge slightly concaved, parastyle lingually 
more developed than protocone; M! developed, lingual edge 
extended to form oval, clear concavity between paracone and 
metacone at labial edge; Position of |, obviously backward; C, 
developed, with curving sharp rear rim; P, much smaller than 
P, and P4; P, less complicated than P^ with developed center 
cusp; M, with developed anterior trigon and three similar size 


cusps on crown, second half without clear cusps, but with 
sharp edge; M, contract conspicuous, round, with outer and 
inner cusps (Figure 1). 

Melogale moschata ferreogrisea 


Synonyms: (Hilzheimer, 


1905). 


Measurements: Holotype measurements (mm) of Melogale 
cucphuongensis guadunensis Li et al., subsp. nov. are listed 
in Table 1. 





Table 1 Measurements (mm) of Melogale cucphuongensis guadunensis Li et al., subsp. nov. holotype 
Character BW BL TL HF El: UTR LTR Pt P'-M' 
Measurement 800 350 180 55 30 31.54 30.03 6.61 20.86 


Body weight (BW) (g), Body length (BL), Tail length (TL), Hind foot length (HF), Ear length (EL), Upper tooth row (UTR), Lower tooth row (LTR), P^: 


length of P^, P'-M': length from front edge of P! to end of M’. 


Molecular genetics: Based on a 1 140 bp fragment of cyt 
b and using Taxidea taxus as the outgroup (all obtained from 
GenBank except for holotype in the study), Bayesian 
phylogram of M. personata, M. moschata, M. 
cucphuongensis (JN032659 which is only 423 bp of cyt b) 
and Arctonyx eight genera of Mustelidae those who distribute 





Taxidea taxus AF057132 


1.00 


in China are displayed. Each group is listed in the tree 
followed by its GenBank Accession No. As shown in Figure 
2, M. c. guadunensis subsp. nov. and M. c. cucphuongensis 
formed a sister group, which, combined with the sister group 
of M. personata and M. moschata, formed the Melogale 
clade. 
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Figure 2 Bayesian phylogram inferred from 1 140 bp fragment of cyt b 


Numbers above branches are Bayesian posterior probabilities. 


Based on the uncorrected pairwise distances, the genetic 
difference between M. c. guadunensis subsp. nov. and M. 
moschata was 7.46-7.63, between M. moschata and M. 
personata was 4.12—4.82, between M. c. guadunensis subsp. 


Mustela sibirica HM106317 








1.00 








L—— Mustela eversmannii AB026102 


nov. and M. personata was 8.42, between M. c. 
cucphuongensis and M. personata was 8.27, and between M. 
c. guadunensis subsp. nov. and M. c. cucphuongensis was 
1.42 (Table 2). 


Zoological Research 40(6): 575-579, 2019 577 


Table 2 Uncorrected pairwise distances estimated from a 1 140 bp fragment of cyt b 


Sample 1 2 3 4 5 6 7 8 

1 

2 1.42 

3 8.42 8.27 

4 7.46 7.57 4.21 

5 7.63 7.57 482 2.89 

6 7.54 7.33 412 0.26 2.81 

7 15.70 14.42 16.84 16.23 16.58 16.14 

8 15.35 13.48 14.39 14.39 14.39 14.30 17.98 

9 14.82 14.89 15.00 15.35 15.00 15.26 16.75 11.32 
10 14.30 13.48 14.12 13.77 13.95 13.86 17.37 7.37 
11 13.51 13.00 14.21 13.60 14.21 13.51 14.56 16.49 
12 14.30 13.48 14.56 14.56 14.21 14.47 17.46 15.09 
13 16.32 14.89 16.75 16.05 15.88 15.96 825 17.02 
14 13.86 12.77 14.21 13.25 14.30 13.16 15.88 14.91 
15 12.63 11.58 14.56 13.77 14.04 13.68 16.23 14.74 
16 13.86 12.29 14.04 13.77 13.77 13.68 15.61 16.05 
17 13.95 13.48 14.91 14.39 14.47 14.30 16.40 14.39 
18 15.35 13.00 15.18 15.35 15.26 15.26 15.96 14.82 
19 14.39 13.95 15.53 15.44 15.53 15.35 16.14 15.96 


1: Melogale cucphuongensis guadunensis; 2: M. c. cucphuongensis; 3: 


9 10 11 12 13 14 15 16 17 18 
10.18 

15.00 15.35 

13.25 13.60 10.96 

16.40 16.58 16.75 17.28 

13.77 14.56 12.81 13.25 16.49 

13.68 14.39 14.65 13.33 15.35 7.81 

15.35 15.18 13.60 14.74 15.88 7.37 7.98 

13.86 14.56 14.91 14.47 15.79 8.42 3.77 8.07 

14.74 15.18 14.91 15.96 15.70 13.68 12.63 11.75 13.42 
14.47 14.65 14.39 14.30 16.32 14.65 14.04 14.91 14.39 15.88 


M. personata; 4: M. moschata AF498158; 5: M. moschata KP726273; 6: M. 


moschata 020644; 7: Arctonyx collaris; 8: Aonyx cinerea; 9: Lutra lutra; 10: Lutrogale perspicillata; 11: Martes foina; 12: M. flavigula; 13: Meles 
meles; 14: Mustela altaica; 15: M. eversmannii; 16: M. nivalis; 17: M. sibirica; 18: M. strigidorsa; 19: Taxidea taxus. 


In contrast to the description of the characteristics of M. c. 
cucphuongensis (Nadler et al., 2011), we list a key to the 
subspecies below: 


Key to Melogale cucphuongensis subspecies 
1) Cheeks yellow-brown; tail pale brown all around with 
whitish hairs tips, transforming to yellow-brown at end; 
P4~1/3 length of P'-M' --- M. c. guadunensis subsp. nov. 
Cheeks brown, with no white markings; tail uniform in 
color; P*~1/4 length of P'-M' M. c. cucphuongensis 
Melogale c. guadunensis subsp. nov. is similar to M. 
moschata in a number of characteristics, including brown- 
black head (deeper in color than back), naked wedge-like 
nose terminal, and small whitish patch between eyes and 
ears, respectively (Figure 1). As M. moschata occurs 
throughout southern China (Corbet & Hill, 1992; Smith & Xie, 
2009; Wang, 2003; Wilson & Reeder, 2005), many previously 
discovered Melogale samples from China, especially from 
central and southeastern areas, have long been regarded as 
M. moschata. However, M. c. guadunensis subsp. nov. can 
be clearly distinguished from M. moschata by the following 
characteristics: whitish stripe absent on center of occiput 
backward to neck (vs. present) (Figure 1); rhinarium wedge- 
like (vs. subcircular) (Figure 1); P^ ~1/3 length of P'-M' (vs. ~1/ 
4) (Table 1); tail length over 50% of body length (Table 1) (vs. 
less than 50%) (Gao, 1987; Zheng & Xu, 1983). 
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The genetic differences between M. c. guadunensis subsp. 
nov. and M. moschata and M. c. guadunensis subsp. nov. 
and M. personata are greater than that between M. moschata 
and M. personata. Furthermore, the genetic difference 
between M. c. guadunensis subsp. nov. and M. c. 
cucphuongensis is only 1.42 (Table 2). These results not only 
confirm that M. c. guadunensis subsp. nov. is markedly 
different from M. moschata and M. personata, but also 
demonstrate that M. c. guadunensis subsp. nov. and M. c. 
cucphuongensis are the same species. Phylogenetic analysis 
also indicated that M. c. guadunensis subsp. nov. and M. c. 
cucphuongensis formed the M. cucphuongensis clade, and M. 
moschata and M. personata were phylogenetically closer than 
either to M. cucphuongensis (Figure 2). 

Melogale c. cucphuongensis, which is known from Cuc 
Phuong National Park in Vietnam, is found in limestone 
formations with primary, slightly degraded primary, and 
secondary forests (Nadler et al, 2011). Melogale c. 
guadunensis subsp. nov. is distributed in Guadun, Wuyishan, 
Fujian, China, within subtropical evergreen broadleaf forest 
and coniferous and broad-leaved mixed forest. The two type 
localities are separated by the vast expanse of southern 
China, which raises the question: how does the species 
disperse across such an expansive area? Currently, very 
limited information is available in regard to its biology and 
ecology. Thus, additional studies on its distribution, population 
status, and biological habits are required. 
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ZOOLOGICAL RESEARCH 


Ongoing green peafowl protection in China 


DEAR EDITOR, 


The green peafowl (Pavo muticus) is one of the most 
threatened pheasants in the world. In China, it is widely 
appreciated for its beauty as well as historical and cultural 
value, but current populations number less than 500 
individuals. Recently, Tang and colleagues reported in 
Science that the green peafowl is likely to become extinct due 
to the construction of the Jiasajiang Level 1 Hydropower 
Station within the Red River Upstream District (RRUD) and 
thus called for a stop to this project (Tang et al., 2019). 
According to our recent surveys, however, this species is still 
extant in 22 counties of Yunnan Province, China, among 
which, only two within the RRUD have been predicted to be 
affected by floods from the hydropower station. Therefore, the 
conclusion that the species will likely go extinct in China upon 
completion of the dam is unwarranted. In fact, construction of 
the Jiasajiang Level 1 Hydropower Station was stopped in 
August 2017. The main challenge for green peafowl 
conservation is that over 65% of the population occurs outside 
of protected reserves in China. Fortunately, the Chinese 
government has adopted an Ecological Redline (ERL) 
strategy to achieve ecological civilization plans, thus bringing 
new hope to the conservation of green peafowls both inside 
and outside of protected reserves. As a top conservation 
priority for China, the government is fully committed to 
conserving this peafowl. 

The green peafowl is one of the most threatened pheasants 
in the world and is currently listed as Endangered on the 
International Union for the Conservation of Nature (IUCN) Red 
List of Threatened Species (BirdLife International, 2016). 
Despites its aesthetic, cultural, and historical value in China, 
recent comprehensive surveys indicate that the Chinese 
green peafowl population currently numbers less than 500 
individuals (Kong et al., 2018). 

Earlier this year, Tang et al. (2019) reported that the green 
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peafowl is likely to become extinct due to the construction of 
the Jiasajiang Level 1 Hydropower Station within the RRUD, 
and therefore called for the project to be ceased. They also 
proposed that additional conservation efforts should be 
considered for this endangered species, such as an 
immediate ecological impact re-evaluation, expansion of 
nature reserves, and increasing number of patrols. As the 
main green peafowl research team in China (Ornithology 
Group of the Kunming Institute of Zoology, Chinese Academy 
of Sciences), we have conducted several green peafowl 
surveys, including two comprehensive surveys in the 1990s 
(Wen et al., 1995) and 2014 — 2018 (Kong et al., 2018), to 
clarify the status and trends of this species in China. In 
addition, for the past five years, we have specifically focused 
on the population dynamics of the green peafowl in the entire 
RRUD as well as its behavioral ecology and impacts of human 
disturbance in Konglonghe Reserve within the RRUD. 
Following the assertions of Tang et al. (2019), we would like to 
clarify several points. 

Tang et al. (2019) stated that the RRUD is China's last 
green peafowl habitat. While it is undoubtedly true that the 
green peafowl has experienced a sharp range contraction 
(76096) in the past three decades (Kong et al., 2018; Wen et 
al., 1995), our recently published surveys (e.g., Kong et al., 
2018), as cited in Tang et al. (2019), found this species to be 
extant in 22 counties of Yunnan Province, China. Among 
these 22 counties, just six fall within the RRUD (Kong et al., 
2018) (Figure 1) and only two (Xinping and Shuangbai) are 
predicted to be affected by floods emanating from the 
hydropower station construction (CPCGKSDIC, 2014). 
Therefore, the conclusion that the species will likely go extinct 
in China upon completion of the dam is unwarranted. 

Nevertheless, given that large infrastructure projects are 
potentially damaging to biodiversity (Johnson et al., 2017), we 
agree with Tang and colleagues that work on the Jiasajiang 
Level 1 Hydropower Station should stop until assessments 
are complete. Indeed, due to concerns raised by conservation 
stakeholders in China on the plight of the green peafowl within 
the RRUD, construction of the Jiasajiang Level 1 Hydropower 
Station was stopped in August 2017. At the same time, other 
projects under construction within the area, including a silver 
mine, second hydropower station, and proposed expressway, 
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Figure 1 Distribution sites and population size of green peafowl in Yunnan, China 


have ceased or been cancelled. 

In addition to infrastructure projects, habitat conversion, 
poisoning, and poaching are also major threats affecting the 
survival of the green peafowl in China (Kong et al., 2018). Our 
surveys in 2018 showed that over 65% of the green peafowl 
population occurs outside protected reserves in China, 
making this species even more vulnerable to the 
aforementioned threats. 

However, China's ambitious Ecological Civilization Plan 
(UNEP, 2016; Xiao & Zhao, 2017) brings new hope for the 
conservation of the green peafowl and the other threatened 
species and ecosystems in the country. Since late 2012, 
environmental protection has become the core element in 
Chinese governmental work. In 2017, for the first time, senior 
officials in Gansu Province, Northwest China, were punished 
over environmental violations in the Qilian Mountains National 
Nature Reserve (Xinhua, 2017). To achieve an ecological 
civilization, the Chinese government has adopted an 


Ecological Redline (ERL) strategy. The ERL refers to the 
strictly controlled and legally drawn boundaries for key areas 
of ecological functions or ecological sensitivity (UNEP, 2016). 
These ERL areas are now formally protected by provisions 
within China's newly revised Environmental Protection Law of 
2015 (Bai et al., 2016). Fortunately, the ERL zones in Yunnan 
Province, demarcated and formally announced on 29 June 
2018 (The People's Government of Yunnan Province, 2018), 
cover the entire distribution range of the green peafowl 
population in China. Most green peafowl habitats inside and 
outside protected reserves, including those within the RRUD, 
are now incorporated into the ERL of Yunnan. Therefore, 
according to the Chinese Environmental Protection Law, it will 
be impossible to commence construction of the Jiasajiang 
Level 1 Hydropower Station or other infrastructural projects. 
This will further enhance protection of the green peafowl, 
especially for populations outside of protected reserves within 
the RRUD. 
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In addition to existing efforts geared toward conserving the 
endangered green peafowl in China, there is an ongoing 
program to include additional rangers and camera traps to 
monitor the population status and dynamics as well as current 
and potential threats of this species. Furthermore, as a top 
conservation priority in China, the government is fully 
committed to conserving this peafowl in collaboration with 
research scientists, NGOs, and local communities living in 
close proximity. 


COMPETING INTERESTS 


The authors declare that they have no competing interests. 


AUTHORS' CONTRIBUTIONS 


X.J.Y. and F.W. designed the study. X.J.Y., F.W., D.J.K., P.F.S., J.W., and G. 
Y.L. supervised the analyses and wrote the manuscript. F.W., D.J.K., P.F.S., 
J.W., and G.N.K. revised the manuscript. All authors read and approved the 
final version of the manuscript. 


Fei Wu', De-Jun Kong’, Peng-Fei Shan'?, Jie Wang'?, 
Gladys Nyakeru Kungu'?, Guang-Yi Lu*, Xiao-Jun Yang" 


1 State Key Laboratory of Genetic Resources and Evolution, 
Kunming Institute of Zoology, Chinese Academy of Sciences, 
Kunming Yunnan 650223, China 

? College of Agronomy and Life Sciences, Kunming University, 
Kunming Yunnan 650214, China 

3 Kunming College of Life Science, University of Chinese 
Academy of Sciences, Kunming Yunnan 650204, China 

^ Henan University of Urban Construction, Pingdingshan 
Henan 467036, China 

"Corresponding author, E-mail: yangxj@mail.kiz.ac.cn 


582  www.zoores.ac.cn 


REFERENCES 


Bai Y, Jiang B, Wang M, Li H, Alatalo JM, Huang S. 2016. New ecological 
redline policy (ERP) to secure ecosystem services in China. Land Use 
Policy, 55: 348-351. 

CPCGKSDIC (China Power Construction Group Kunming Survey and 
Design Institute Co., Ltd.). 2014. Environmental impact assessment report 
of the Jiasajiang Level 1 Hydropower Station in Yunnan province. www. 
doc88.com/p-0992324394228.html.(in Chinese). 

Johnson CN, Balmford A, Brook BW, Buettel JC, Galetti M, Lei GC, 
Wilmshurst JM. 2017. Biodiversity losses and conservation responses in 
the Anthropocene. Science, 356(6335): 270-275. 

Kong DJ, Wu F, Shan PF, Gao JY, Yan D, Luo WX, Yang XJ. 2018. 
Status and distribution changes of the endangered Green Peafowl (Pavo 
muticus) in China over the past three decades (1990s — 2017). Avian 
Research, 9: 18. 

Tang WW, Wang XX, Yan M, Zeng GM, Liang J. 2019. China's dams 
threaten green peafowl. Science, 364(6444): 943. 

The People’s Government of Yunnan Province. 2018. The Red Line of 
Ecological Protection in Yunnan Province (No. 32 [2018]). http://www.yn.gov. 
cn/zwgk/zcwj/zxwj/201806/t20180629 148248. html? 
S pcqq aiomsg. (in Chinese). 

UNEP. 2016. Green is Gold: The Strategy and Actions of China's 
Ecological Civilization. 

Wen XJ, Yang XJ, Han LX, Yang L, Wang WM. 1995. Investigations on the 
current status of the distribution of green peafowl in China. Chinese 
Biodiversity, 3: 46-51. (in Chinese). 

Xiao LG, Zhao RQ. 2017. China's new era of ecological civilization. 
Science, 358(6366): 1008-1009. 


Xinhua. 2017. China punishes officials over environmental violations in 


tdsourcetag= 


Qilian Mountains. http://english. gov. cn/news/top_news/2017/07/21/content_ 
281475743731750.htm. (in Chinese). 


ZOOLOGICAL RESEARCH 


Direct sunlight exposure reduces hair cortisol levels in 
rhesus monkeys (Macaca mulatta) 


DEAR EDITOR, 


Major depressive disorder (MDD), commonly known as 
depression, is a mental disease characterized by a core 
symptom of low mood. It lasts at least two weeks (Badamasi 
et al., 2019; Wang et al., 2019) and is frequently accompanied 
by low self-esteem, loss of interest in routinely enjoyable 
activities, low energy, and unexplained pain (Huey et al., 
2018; Park et al., 2012; Post & Warden, 2018; Rice et al., 
2019; Xiao et al., 2018). Approximately 296—896 of adults with 
MDD commit suicide (Richards & O'Hara, 2014; Strakowski & 
Nelson, 2015), and around half of suicidal individuals suffer 
depression or other mood disorders (Bachmann, 2018). 
According to the criteria of the American Psychiatric 
Association's Diagnostic & Statistical Manual, Fourth Edition 
(DSM-IV), seasonal affective disorder (SAD) is a type of 
depression (Wirz-Justice, 2018) and can occur with major 
depressive episodes (Birtwistle & Martin, 1999). SAD sufferers 
exhibit various associated symptoms, such as feelings of 
hopelessness and worthlessness, suicidal thoughts, loss of 
interest in activities, and withdrawal from social interactions 
(Austen & Wilson, 2001; Eagles, 2004). SAD is typically 
associated with winter depression, and symptoms normally 
improve upon the arrival of summer (Austen & Wilson, 2001). 
Seasonal mood variations are also believed to be associated 
with light. Specifically, SAD is reported to be correlated with 
duration of sunlight exposure (Lam et al., 2006). In spring and 
summer, sunlight exposure is longer and daytime hours 
exceed nighttime hours. In winter, sunlight exposure becomes 
shorter and evenings arrive earlier. Thus, SAD is noticeably 
more frequent at latitudes in the Arctic region, including Alaska 
(N64*00') where the rate of SAD is as high as 9.2% (Booker 
& Hellekson, 1992; Kegel et al., 2009). Cloud coverage may 
contribute to the negative effects of SAD (Modell et al., 2005). 
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There is growing evidence that many SAD patients exhibit 
delays in circadian rhythm and that bright light therapy may 
improve symptoms by correcting these delays (Avery et al., 
2001a; Lam & Levitan, 2000). 

Although bright light is known to be an effective therapy for 
depression (Avery et al, 2001b; Howland, 2009), the 
underlying mechanism has remained elusive. Etiological 
hypotheses have focused on alterations in biological cycles 
due to seasonal light variations. For example, previous 
research on starry flounder showed that prolonged light 
exposure was associated with a reduction in melatonin 
secretion (Bógner et al., 2018). Melatonin plays a vital role in 
the regulation of biological rhythms (Ardura et al., 2003; Emet 
et al., 2016). As a result, the quality of sleep in SAD sufferers 
declines and they are more likely to get nervous and stressed. 
Although light has been shown to relieve depression by 
impacting the circadian rhythms of cortisol secretion (Avery et 
al., 1997; Koller et al., 1994; Rutten et al., 2019), other studies 
have failed to detect any correlation between light and cortisol 
rhythm (Bogner et al., 2018; Schmidt et al., 2018). Thus, these 
conflicting results indicate that other aspects of cortisol 
variation may be involved. 

The cumulative level of cortisol is considered to be an 
important biochemical factor in the pathogenesis and 
treatment of depression (Caparros-Gonzalez et al., 2017; 
Keller et al., 2017). The rate of depression is higher among 
patients with hypercortisolism (Maripuu et al., 2014), and the 
cumulative level of cortisol is abnormal in 4096-6096 of 
depression sufferers (Booij et al., 2015). More importantly, 
most anti-depressants work through their effects on cortisol 
receptors and the alteration in cumulative levels of cortisol 
(Carvalho & Pariante, 2008; Holsboer, 2000; Mason & 
Pariante, 2006). Based on these assumptions, light therapy 
may improve depression by altering the cumulative levels of 
cortisol, with levels possibly correlated with light duration. 
Thus, in the current study, we investigated the correlation 
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between light duration and cumulative levels of hair cortisol in 
rhesus macaques (Macaca mulatta). 

As the final product of the hypothalamic-pituitary-adrenal 
(HPA) axis, cortisol is an important stress hormone and can 
reflect and regulate stress system function. Compared with 
other analytical methods, including serum and saliva cortisol, 
hair cortisol levels are not affected by circadian rhythm, food 
intake, or acute stress (Li et al., 2009; Yamada et al., 2007). 
Furthermore, the cortisol in hair is freely combined and thus 
biologically active and relatively stable. It can reflect 
cumulative cortisol levels in an organism over a 3—5-month 
period and is thus suitable for measuring the long-term 
cumulative levels of cortisol (Davenport et al., 2006; Kalra et 
al., 2007). 

To examine the effects of sunlight exposure duration on 
HPA axis function, hair samples were collected twice in 7-year- 
old monkeys. The position of monkeys was recorded on the 
second day after hair sampling to exclude potential 
interference from sampling and to investigate the duration of 
sunlight exposure. A total of 20 and 22 age-matched adult 
male rhesus macaques were assigned to a short sunlight 
exposure group (n=8, 4, 4, 4 within each cage) and long 
sunlight exposure group (n=7, 4, 4, 4, 3 within each cage), 
respectively. All animals resided within their groups with fixed 
counterparts for more than six months and their living 
environments were deemed stable. This study was approved 
by the Biological Research Ethics Committee of the Kunming 
Institute of Zoology, Chinese Academy of Sciences. All animal 
procedures were conducted in accordance with the National 
Institutes of Health Guide for the Care and Use of Laboratory 
Animals. 

The adult males lived with their cage mates either in small 
(n<4) or large social groups (n=7). They were raised in indoor 
(2.61 mx2.46 mx2.58 m) and outdoor (2.67 mx2.66 mx 
2.67 m) colonies. Each group of rhesus macaques was able 
to enter and exit the cage at will, i.e., they could freely move 
between indoor and outdoor colonies. Monkeys in the outdoor 
colonies were exposed to sunlight during the daytime. 
Commercial monkey biscuits were provided twice daily with 
tap water ad libitum, and fruit and vegetables were offered 
once daily. 

The cages of the short sunlight exposure group were 
oriented eastwards, whereas the cages of the long sunlight 
exposure group were oriented southwards, thus the durations 
of sunlight exposure differed between the two groups. The 
east- and south-oriented cages received direct sunlight 
simultaneously. At sunset, the east-oriented cages were cast 
under a large shadow, whereas the south-oriented cages 
continued to have direct sunlight for an additional hour. As a 
result, east-oriented cage mates represented the short 
sunlight exposure group, whereas the south-oriented cage 
mates represented the long sunlight exposure group. 

The monkeys were approximately seven years old (7+ 
1 years). Hair samples were collected between 1330h and 
1500h. Each animal was captured by an experienced 
technician using a net and subsequently taken out of the 
colony for hair sampling. After manual restraint, the back hair 
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was clipped with scissors and then stored in a small pouch of 
aluminum foil. The hair cortisol extraction procedure was 
based on our previous study (Feng et al., 2011). For removing 
surface contaminants, hair samples were rinsed twice in 5 mL 
of isopropanol (3 min each), dried at 35 °C for 8 h, and then 
pulverized using a Retsch ball mill (Retsch MM400, Germany) 
at 26 Hz for 2.5 min. Approximately 200 mg of powdered hair 
was weighed and incubated at a slow rotation for 24 h in 4 mL 
of methanol at room temperature. After centrifugation at 8 000 
g for 5 min at 4 °C, 2 mL of supernatant was transferred into a 
centrifuge tube and dried under a stream of nitrogen gas. 
Finally, the extract was reconstituted with 0.25 mL of 
phosphate buffer solution (PBS) and assayed for cortisol 
levels using a commercial kit (CORTISOL RIA KIT REF 
IM1841, Beckman, USA). 

Dedicated observers were stationed in front of the outdoor 
cages to record the position of the caged monkeys and to 
calculate the duration of direct sunlight exposure. 
Observations were conducted for 2 h daily, including 1 h in the 
morning and 1 h in the afternoon. If a monkey spent time in 
the outdoor colony, then it was considered to have received 
direct sunlight. In contrast, those in the indoor colony received 
no direct sunlight. To avoid disturbing the animals, observers 
stayed at least 5 m away from the cage during recording. 
Specific measurement procedures were followed. Using 
Monkey-A as an example, when it stayed in an outdoor 
colony, the precise timepoint was recorded (e.g., 1010h). If it 
moved to an indoor colony, the precise timepoint was also 
recorded (e. g., 1015h) The interval between the two 
timepoints was marked (e.g., 5 min) as one episodic event of 
direct sunlight exposure. Multiple episodes of locomotion 
occurred within the 1 h of recording. All recorded durations of 
direct sunlight exposure in the morning were added for total 
morning duration of direct sunlight exposure (e.g., 5+7+3+2+ 
17=34 min). Afternoon durations of direct sunlight exposure 
were collected similarly. The total duration of direct sunlight 
exposure was calculated by summing all durations. 

The long sunlight exposed monkeys in the south-oriented 
cages had significantly lower cumulative levels of cortisol than 
the short sunlight exposed counterparts in the east-oriented 
cages (Mann-Whitney test, n=42, U=123, P=0.014) (Figure 
1A). After removing two outliers, the same pattern was still 
found (unpaired t-test, n=40, t=2.388, P=0.022) (Figure 1B). 

Results showed that the long sunlight exposed monkeys 
had significantly lower cumulative levels of cortisol than their 
short sunlight exposed counterparts. To the best of our 
knowledge, this is the first study demonstrating that the effects 
of sunlight exposure on the cumulative cortisol levels in 
monkeys are stable and reproducible. Furthermore, this study 
has pioneered the use of hair analysis to prove that longer 
sunlight exposure can lower the level of freely combined, 
biologically active cortisol in rhesus monkeys. Thus, stable- 
living rhesus monkeys could serve as an excellent animal 
model for examining sunlight exposure in humans and 
elucidating the mechanism of SAD for more effective 
interventions. Our results agree with previous findings on the 
effects of indoor artificial light on monkeys (Qin et al., 2015). 
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Figure 1 Comparison of cumulative hair cortisol levels in long and short sunlight exposed monkeys 

A: Specific concentrations of hair cortisol in each monkey in short (east-oriented cages, n=20, red squares) and long sunlight exposed monkeys 
(south-oriented cages, n=22, gray triangles). Data are meanstSEM (standard error of the mean). *: P«0.05. B: Specific concentrations of hair 
cortisol in long (n=22) and short sunlight exposed monkeys (n=18) after removing two outliers. 


Sunlight has a relatively wide spectrum, and thus the impact 
of direct natural sunlight exposure is more enduring and 
stable. Although other research has shown the opposite 
results (Tian et al., 2019), we speculate that different subjects 
or species and methods of cortisol extraction may account for 
these inconsistencies. 

One possible reason for the long-lasting hypercortisolism in 
the short sunlight exposed monkeys could be excessive 
activation of the HPA axis induced by short duration of direct 
sunlight exposure, a common stressor for daily living. The 
development of psychiatric disorders such as SAD may 
include an onset of hypercortisolism phase due to a lack of 
sunlight exposure. In contrast, due to the lack of a short 
sunlight exposure stressor, the long sunlight exposed 
monkeys exhibited a lower level of hair cortisol. It could be 
speculated that a lower level of cumulative cortisol may have 
a positive effect on SAD patients. 
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ZOOLOGICAL RESEARCH 


Diversity scaling of human vaginal microbial 


communities 


DEAR EDITOR, 


The composition and diversity of the human vaginal microbial 
community have been investigated intensively due to the 
diversity-stability relationship (DSR) -based hypothesis for 
bacterial vaginosis (BV) etiology, which was first proposed in 
the 1990s and has received renewed interest in recent years. 
Nevertheless, diversity changes (scaling) across individuals in 
a cohort or population have not yet been addressed, which is 
significant both theoretically and practically. Theoretically, 
biodiversity scaling is the core of biogeography, and 
practically, inter-subject heterogeneity is critical for 
understanding the etiology and epidemiology of human 
microbiome-associated diseases such as BV. Here we applied 
the diversity-area relationship (DAR), a recent extension to 
the classic species-area relationship (SAR), to study diversity 
scaling of the vaginal microbiome by reanalyzing reported 
data collected from 1107 postpartum women. The model 
used here characterized the power-law (or its extension) 
relationships between accrued diversity and areas (numbers 
of individuals), upon which four biogeographic profiles were 
thus defined. Specifically, we established the DAR profile 
(relationship between diversity scaling parameter and so- 
termed diversity order (q)), similarly pair-wise diversity overlap 
(PDO) profile, maximal accrual diversity (MAD) profile, and 
ratio of individual-level to population-level diversity (RIP) 
profile. These four profiles offer valuable tools to assess and 
predict diversity scaling (changes) in the human vaginal 
microbiome across individuals, as well as to understand the 
dynamics of vaginal microbiomes in healthy women. 

The human vaginal microbiome is a complex ecosystem 
that plays critical roles in maintaining host health. As the first 
defense of the reproductive tract, the vaginal microbiome is 
critical for the prevention of opportunistic pathogen 
colonization and viral infection. For example, endogenous, 
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healthy vaginal microbiota can help protect against HIV 
infection by activating local and systemic inflammation; 
however, microbiota associated with BV can also increase 
susceptibility to HIV infection (Buvé et al., 2014; Petrova et al., 
2013). For pregnant women, the vaginal microbiota is not only 
associated with maternal health but also that of neonates, with 
the composition of the newly colonized microbiome playing a 
key role in newborn immunity and metabolic development 
(Cox et al., 2014; Dominguez-Bello et al., 2010; Olszak et al., 
2012; Rutayisire et al., 2016). Furthermore, babies delivered 
by cesarean section can have a higher risk of metabolic and 
immune diseases than those delivered vaginally (Dominguez- 
Bello et al., 2010; Sevelsted et al., 2015; Younes et al., 2018), 
although Chu et al. (2017) noted that delivery mode does not 
influence microbiome composition in newborns. Moreover, in 
pregnancy, vaginal dysbiosis is hypothesized to be a 
contributor to spontaneous preterm birth (Freitas et al., 2018; 
Romero et al., 2014a; Stout et al., 2017) and miscarriage 
(Ralph et al., 1999). 

In many healthy women, the vaginal microbiota is 
dominated by Lactobacillus spp. (Macklaim et al., 2013; Ravel 
et al., 2011). Several studies (Brotman et al., 2014; Gajer et 
al., 2012; Ma & Li, 2017; Ravel et al., 2011) have confirmed 
the five major community state types of the vaginal 
microbiome in adult women, as first identified by Ravel et al. 
(2011). Four types are dominated by Lactobacillus spp., 
including L. iners, L. crispatus, L. gasseri, and L. jensenii. 
However, 2096-3096 of asymptomatic, otherwise healthy 
individuals lack lactic acid bacteria in their vaginal 
microbiome, which instead consists of obligate anaerobic 
bacteria (Ravel et al., 2011, 2013). In addition, the frequency 
of microbiome type varies in different ethnic groups, with 
those microbiome not dominated by Lactobacillus spp. more 
commonly found in healthy Hispanic and black women than in 
Asian or white women (Ma et al., 2012; Ravel et al., 2011). 
Furthermore, the composition of the vaginal microbiome is 
dynamic during life and associated with menopause stage 
(Muhleisen & Herbst-Karlovetz, 2016). Recent research 
demonstrated the vaginal microbiome of perimenarcheal 
adolescents to be dominated by Lactobacillus spp., including 
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L. crispatus, L. iners, L. gasseri, and L. jensenii, similar to that 
found in reproductive-age women (Hickey et al., 2015). In 
premenopausal women, the vaginal microbiota is still 
dominated by L. crispatus and L. iners, but Lactobacillus spp. 
are often replaced by Streptococcus and Prevotella in the 
perimenopausal and postmenopausal stages (Brotman et al., 
2014). Shifts in vaginal microbiome have also been observed 
during and after pregnancy. For example, diversity and 
richness of the vaginal microbiome is lower in pregnant 
women than in non-pregnant women (Freitas et al., 2017). 
Furthermore, Romero et al. (2014b) showed that the vaginal 
microbiome of pregnant women contains a higher abundance 
of L. vaginalis, L. crispatus, L. gasseri, and L. jensenii, and a 
lower probability of switching to a Lactobacillus-deficient 
community. In addition, radical changes in Lactobacillus-poor 
vaginal communities have been found at delivery, which can 
persist for up to a year (DiGuilio et al., 2015). 

Despite extensive studies on the human vaginal 
microbiome, what constitutes normal or healthy vaginal 
microbiota remains unresolved. For example, Doyle et al. 
(2018) sampled and sequenced the vaginal microbiome of 
1 107 rural Malawi women after pregnancy, and found that 
75.7% (752/994) of the population were dominated by 
Gardnerella vaginalis rather than by Lactobacillus spp., and 
although L. iners increased with time after delivery, G. 
vaginalis still dominated for an extended period. In Doyle's 
study, both the pregnancy delivery mode and ethnicity also 
appeared to influence the composition of vaginal microbiome, 
though all hosts were healthy. 

Previous research has revealed that the biodiversity of 
vaginal microbial communities varies with health status and 
lifestyle of the host. Nevertheless, existing studies have not 
addressed diversity scaling (changes) across individuals in a 
cohort or population. Theoretically, microbiome diversity 
distribution across individual subjects (i. e., space) is 
traditionally a focus of microbial biogeography. Practically 
speaking, understanding the biogeography of the human 
microbiome can reveal critical information on its 
characteristics in a cohort setting, which can, in turn, 
significantly influence studies on the etiology and 
epidemiology of human microbiome-associated diseases such 
as inflammatory bowel disease, obesity, and BV. To effectively 
assess the spatial scaling of human vaginal microbial 
diversity, we applied the DAR model, which is a recent 
extension of the classic SAR in biogeography and 
conservation biology (Bell et al., 2005; Horner-Devin et al., 
2004; MacArthur & Wilson, 1967; Noguez et al., 2005; Peay et 
al., 2007; Triantis et al., 2012; Várbíró et al., 2017; Whittaker 
& Triantis, 2012). SAR is one of the oldest described 
ecological laws or patterns, whereby species richness 
increases with increasing sampling area, and can be traced 
back to the 19th century (Watson, 1835). It is still considered 
one of the most important principles in conservation biology 
and biogeography. The extensions from SAR to DAR 
introduced a several important advances including: 
(1) Expanding species richness (number of species) to 
general diversity measures in Hill numbers (Chao et al., 2012, 
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2014; Hill, 1973), thus making it possible to not only assess 
the scaling of species richness (numbers), but also scaling of 
general diversity (e. g., change in community evenness or 
dominance). Therefore, the classic SAR is a special case of 
the more general DAR; (2) The DAR, PDO, MAD, and local 
regional/global diversity (LRD/LGD) profiles are effective tools 
for the biogeographic mapping of biodiversity over space (Ma, 
2018a, 2018c, 2019). 

In this study, we applied DAR modeling and associated 
biogeographic profiles to investigate the spatial diversity 
scaling of postpartum vaginal microbial communities across 
individuals by reanalyzing the large vaginal microbiome 
dataset originally reported by Doyle et al. (2018). The spatial 
diversity scaling of the vaginal microbiome revealed 
heterogeneity among individuals, which could provide an 
ecological basis for personalized and precise diagnosis and 
treatment of microbiome-associated diseases, including BV. 
The biogeographic profiles of the vaginal microbiome also 
provide tools for explaining the DSR hypothesis for BV 
etiology from multiple dimensions (Ma & Ellison, 2018, 2019). 

The vaginal microbial dataset (Doyle et al, 2018) 
reanalyzed in this study consisted of 1 158 vaginal microbiome 
samples collected from 1107 rural Malawi women post- 
delivery. Most samples were collected within the first 20 d of 
delivery, though some were sampled 5-583 d post-delivery. 
The V5-V7 hypervariable regions of the 16S rRNA genes 
were amplified and sequenced under the MiSeq Illumina 
platform. After quality control, the sequences were clustered 
into 14 354 operational taxonomic units (OTUs) using QIIME 
2.8.6. Samples with less than 2 000 reads were removed, as 
were OTUs with less than 1 000 reads. After prescreening, 
1076 samples and 466 OTUs remained for DAR analysis. In 
DAR analysis, the number of each OTU read is equivalent to 
the population abundance of a species in macro-ecology, or 
OTU abundance in diversity analysis. More detailed 
information on the dataset can be found in Doyle et al. (2018). 

The Hill numbers (Hill, 1973) were reintroduced to ecology 
by Jost (2007) and Chao et al. (2012, 2014), and possess 
certain critical advantages over traditional diversity indexes. 
The Hill numbers for measuring alpha diversity are as follows: 


s MA) 
p= | ` e (1) 
i=1 


When q=1, the Hill number is undefined, but its limit exists in 
the following form: 


S 
'Dz lim °D = p|- 2; pios(p,)| (2) 


where, D is the diversity in Hill numbers, q (=0, 1, 2,...) is the 
order number of diversity, S is the number of species (or 
OTUs), and p, is the relative abundance of species i. The 
diversity order (q) sets the sensitivity of the Hill numbers to the 
relative frequencies of species abundances. When q=0, °D is 
equal to the number of species or species richness (S). When 
q=1, 1D is the number of typical or common species in the 
community and is equal to the exponential of Shannon 
entropy. When q=2, ?D is more sensitive to species with high 
abundance, and is equal to the inverse of the Simpson index. 


Generally, 1D is the diversity of a community with x-?D equally 
abundant species. 

Beta-diversity can be defined with the  multiplicative 
partitioning of Hill numbers (Chao et al., 2012, 2014; Ellison, 
2010; Gotelli & Chao, 2013; Jost, 2007), as follows: 

"D = °D / °D, (3) 
where, 1D, and °D, are the alpha and gamma diversities in 
terms of Hill numbers, respectively. As 7D, is equivalent to the 
alpha diversity of the meta-community, it has the same 
definition as alpha-diversity (Eqn. (1)). Chao et al. (2012, 
2014) defined a series of Hill numbers corresponding to 
different diversity orders (q) as the diversity profile. In this 
study, the diversity or Hill numbers were computed until the 
third order, g=3. 

According to Ma (2018a), we used the power law (PL) DAR 
model and power-law with exponential cutoff (PLEC) model as 
the DAR models for the human vaginal microbiome. The PL 
model is: 

"D = cA’ (4) 
where, ?D is diversity measured in Hill numbers of the q-th 
order, A is the area (number of individuals), and c & z are the 
PL parameters. 

The PLEC model is: 

QD = cA'exp(dA ), (5) 
where, d is a third parameter that is usually less than zero in 
DAR modeling, and exp(dA) is then the exponential decay 
item that eventually overwhelms the power law behavior when 
Ais sufficiently large. 

To simplify parameter estimation, we transformed non-linear 
Equations (4) and (5) into log-linear regression equations: 

In(D ) = In(c) + zin(A) (6) 
In(D) = In(c) + zin(A)+ dA (7) 

In Eqn. (6), z is the slope of the log-linear transformed PL 
model, which is equivalent to its interpretation in the traditional 
SAR—ratio of diversity accrual rate to area increase rate. 
Parameter c of the PL model can be viewed as the number of 
species equivalent to diversity in the first unit of area to 
accrue. Thus, the accrual order of area unit may influence 
parameter c. To deal with this technical issue, the units 
(individuals/samples) to be accumulated were randomly 
permutated each time the DAR model was built. For each 
dataset, we repeatedly applied DAR modeling 100 times by 
randomly re-ordering all samples in the dataset. For the 
detailed computational procedure, please refer to Ma (2018a). 

Similar to the diversity profile concept of Chao et al. (2012, 
2014), which is a series of Hill numbers corresponding to 
different diversity orders (q), Ma (2018a) and Ma & Li (2018) 
proposed four DAR-based profiles, including the DAR, PDO, 
MAD, and LRD/LGD profiles. These four profiles can be 
quantitatively characterized by parameters from the PL/PLEC 
DAR models and can be used to sketch out biogeography 
maps of the human microbiome or other ecological 
communities. 

The DAR profile was defined as a series of z-values 
(scaling parameter) of the PL-DAR model (Eqns. 4 & 6), i.e., a 
series of z-values corresponding to different diversity orders 
(q) or z-q trends. 


The PDO profile was defined as: 
g = 2-2 (8) 
where, z is the scaling parameter of the PL-DAR model, i.e., 
the PDO profile is a series of g (q) values corresponding to 
different diversity orders (q), computed with Eqn. (8). 
The MAD profile was defined as a series of MAD or "Dmax 
values, corresponding to different diversity orders (q): 
"Dmax = e) exp(=z)= cA, exp(-z) (9) 


max 


where, A,,,, =—z/d is the number of individuals (samples) 
needed to reach the MAD, and c and z are parameters of the 
PLEC-DAR model (Eqns. (5) & (7)). 

The RIP profile was defined as a series of RIP values 
corresponding to different diversity orders (q), as specified by 
the following equation: 

"RIP = “cl "D (10) 
where, c is a parameter of the PL-DAR model and D is the 
diversity in Hill numbers estimated with the PLEC-DAR model 
(Eqns. (5) & (7)). Based on the above RIP definition, a RIP 
profile can be defined for a population (cohort) of any size. In 
practice, using Dmax for 7D is more convenient, i.e.: 

"RIP = “cl "Dmax . (11) 

The RIP parameter assesses the average level of an 
individual to represent a population (or cohort) from which the 
individual is a member. The RIP profile is also known as the 
LRD (local-to-regional diversity or LGD  (local-to-global 
diversity) profile in other ecological systems beyond the 
human microbiome (Ma & Li, 2018; Ma, 2019). 

We built two DAR models for the vaginal microbiome, 
including the PL and PLEC models for alpha-diversity and 
beta-diversity scaling, respectively. The results are listed in 
Tables 1 and 2, including the diversity order (g) of Hill 
numbers, mean model parameters (z, In(c), d, g, D,,,,) and 
their standard errors, and measures (correlation coefficient R 
& P-value) for goodness-of-fitting. N represented the number 
of successful fittings out of 100 p re-samplings, as explained 
previously. Re-sampling was performed to deal with the 
possible influence of the order of diversity accrual (i.e., order 
in which the samples were accrued for building the DAR 
model) on model parameter c. Except for two cases of alpha- 
DAR modeling at diversity order q-3, the fittings to the DAR 
models were successful in all 100 re-samplings. Even in the 
two exceptions, the success rates were 97% and 99%, 
respectively. Therefore, the DAR models were considered 
suitable for vaginal microbiome assessment, as also evident 
by the R (linear correlation coefficient) and associated p 
values, which indicated the goodness-of-fit of the DAR models. 

Based on Table 1, we found the following in regard to alpha- 
DAR scaling: 

(1) As one of the most important parameters from the PL- 
DAR model, the scaling parameter (z) at different diversity 
orders (q) was z(q) 7 (0.807(0), 0.171(1), 0.110(2), 0.095(3)), 
where z(q) represents the DAR profile according to previous 
definition. The DAR profile characterizes the diversity scaling 
across individuals (over space) comprehensively. Results also 
showed that the scaling level differed at different orders. For 
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Table 1 Alpha-DAR models computed with 100 re-samplings for the vaginal microbiome 


Diversity order Power law (PL) 


and statistics z In(c) R P-value g N 


PL with exponential cutoff (PLEC) 
z d In(c) R P-value N A D 





Mean 0.807 3.902 0.998 0.000 0.250 
SE 0.054 0.200 0.001 0.000 0.038 

q-0 100 
Min 0.738 3.474 0.995 0.000 0.160 


Max 0.880 4.356 1.000 0.000 0.332 


0.778 0.0001 4.019 0.999 0.000 
0.062 0.0001 0.313 0.001 0.000 
0.642 —0.0002 3.201 0.995 0.000 
0.948 0.0004 4.683 1.000 0.000 


100 N/A N/A 





Mean 0.171 3.345 0.860 0.000 0.874 
Š SE 0.039 0.257 0.073 0.000 0.031 io 
is Min 0.085 2.669 0.582 0.000 0.789 


Max 0.276 3.951 0.963 0.000 0.940 


0.291 —0.0004 2.861 0.930 0.000 
0.074 0.0002 0.374 0.044 0.000 
0.083 —0.0008 1.820 0.782 0.000 
0.488 0.0001 3.831 0.990 0.000 


100 667 86.8 





Mean 0.110 2.808 0.681 0.000 0.920 
SE 0.0407 0.312 0473 0.000 0.035 

q-2 100 
Min 0.013 1.901 0.155 0.000 0.814 


Max 0.246 3.457 0.933 0.000 0.991 


0.229 —0.0004 2.328 0.829 0.000 
0.080 0.0002 0.411 0.107 0.000 
0.023 —0.0008 1.162 0.443 0.000 
0.448 0.0002 3.225 0.974 0.000 


100 530 34.4 





Mean 0.095 2565 0.589 0.013 0.931 
a SE 0054 0.362 0231 0084 0040 O 
i Min -0.002 1.508 0.013 0.000 0.811 


Max 0.250 3.269 0.932 0.667 1.001 


0.209 —0.0004 2.109 0.770 0.001 
0.088 0.0002 0.447 0.157 0.014 
—0.004 -0.0009 0.890 0.061 0.000 
0.431 0.0003 3.112 0.971 0.138 


99 507 24.5 


z, c & d: The parameters of PL- and/or PLEC-DAR models, in which z of PL-DAR model is the scaling parameter, and c can be viewed as the 
number of species equivalent to diversity in the first unit of area to accrue. g: The pair-wise diversity overlap. R: The correlation coefficient to judge 
the goodness-of-fitting of PL or PLEC models. P-value: The parameter to judge the success or failure of the fitting of PL or PLEC models (Ps0.05 


indicates successful model fitting). N: The number of successful fittings out of 100 re-samplings. A 


: The number of accrued individuals 


max* 


corresponding to the maximal accrual diversity. Dmax: The maximal accrual diversity. SE: Standard error. Min: Minimum. Max: Maximum. N/A: Not 


available. 


example, scaling at diversity order q-0, which is equivalent to 
the classic SAR law, was faster than that at g=1, 2, or 3, as 


0 1 2 3 


Diversity order (q) 


0.75 


0.50) 


0.25 


Scaling parameter (z) or pairwise diversity overlap (g) > 








indicated by the monotonically decreasing z-value (see Figure 
1A for alpha-DAR profile). 


pri 
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Figure 1 DAR profile (z-q) and PDO profile (g-q) of the vaginal microbiome 


A: Alpha-diversity scaling; B: Beta-diversity scaling. 


(2) The PDO profile was g(q)=(0.250(0), 0.874(1), 0.920(2), 
0.931(3)). The PDO profile, which characterizes the overlap or 
similarity between pair-wise individuals, showed the opposite 
trend as the DAR profile, i.e., a monotonically increasing trend 
(see Figure 1A for alpha-PDO profile). 

(3) The MAD profile characterizes the theoretically maximal 
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accumulation of diversity across individuals. Here, regarding 
the MAD profile, the PLEC model failed to produce D,,,, at 
diversity order q=0 because d>0, for which a maximum does 
not exist. For the diversity orders q=1, 2, 3, the PLEC model 
for alpha-diversity successfully generated D,,,,, i.e., D, (q)= 
(86.8(1), 34.4(2), 24.5(3)). 


Table 2 Beta-DAR models computed with 100 re-samplings for the vaginal microbiome 


Diversity order Power law (PL) 


PL with exponential cutoff (PLEC) 














and statistics z In(c) R P-value g N z d In(c) R P-value N A, D, 
Mean 0.805 -0.070 1.000 0.000 0.253 0.770 0.00012 0.070 — 1.000 0.000 
SE 0.010 0.065 0.000 0.000 0.012 0.016 0.00004 0.078 0.000 0.000 

q=0 100 100 N/A N/A 
Min 0.782 -0.217 0.999 0.000 0.228 0.722 -0.00002  -0.131 1.000 0.000 
Max 0.826 0.070 1.000 0.000 0.280 0.812 0.00022 0.313 1.000 0.000 
Mean 0.176 1.591 0.884 0.000 0.870 0.298 -0.00043 1.092 0.952 0.000 
SE 0.031 0.208 0.059 0.000 0.024 0.041 0.00011 0.2219 0.034 0.000 

q-1 100 100 687 15.5 
Min 0.104 1.047 0.686 0.000 0.807 0.215 -0.00076 0449 0.829 0.000 
Max 0.255 2.093 0.974 0.000 0.925 0.435 -0.00022 1.570 0.994 0.000 
Mean 0.146 1.890 0.734 0.000 0.893 0.291 -0.00052 1.295 0.867 0.000 
SE 0.050 0.333 0.178 0.000 0.038 0.076 0.00024 0.368 0.111 0.000 

q-2 100 100 563 173 
Min 0.017 1.026 0.106 0.000 0.791 0.102 -0.00124 0.227 0.335 0.000 
Max 0.274 2.730 0.978 0.001 0.988 0.525 -0.00002 2.356 0.987 0.000 
Mean 0.166 1.963 0.711 0.000 0.877 0.330 -0.00058 1.293 0.850 0.000 
SE 0.062 0421 0.200 0.000 0.049 0.103 0.00035 0.468 0.129 0.000 

q-3 100 100 566 213 
Min 0.021 0.798 | 0.108 0.000 0.735 0.069 -0.00149 0.115 0.203 0.000 
Max 0.339 2917 0.972 0.000 0.986 0.591 0.00016 2663 0.980 0.000 


Z, c & d: The parameters of PL- and/or PLEC-DAR models, in which z of PL-DAR model is the scaling parameter, and c can be viewed as the 
number of species equivalent to diversity in the first unit of area to accrue. g: The pair-wise diversity overlap. R: The correlation coefficient to judge 
the goodness-of-fitting of PL or PLEC models. P-value: The parameter to judge the success or failure of the fitting of PL or PLEC models (Ps0.05 


indicates successful model fitting). N: The number of successful fittings out of 100 re-samplings. A 


The number of accrued individuals 


max* 


corresponding to the maximal accrual diversity. Dmax: The maximal accrual diversity. SE: Standard error. Min: Minimum. Max: Maximum. N/A: Not 


available. 


From Table 2, we found the following in regard to beta-DAR 
scaling: 

(1) As one of the most important parameters of the PL-DAR 
model, the beta-diversity scaling parameter (z) at different 
diversity orders (q) was z(q) = (0.805(0), 0.176(1), 0.146(2), 
0.166(3)), where z(q) represents the DAR profile according to 
previous definition and characterizes diversity scaling across 
individuals (over space) comprehensively (see Figure 1B for 
beta-DAR profile). Comparison between the beta-DAR and 
alpha-DAR profiles revealed an interesting phenomenon: i.e., 
the alpha-DAR profile monotonically decreased with q, 
whereas the beta-DAR profile was valley-shaped. This 
suggests that, at a lower diversity order (q), the alpha-DAR 
and beta-DAR scaling parameters (z) were rather close to 
each other, but the difference was enlarged at higher diversity 
orders (q). 

(2) The beta-PDO profile was g(q) = (0.253(0), 0.870(1), 
0.893(2), 0.877(3)) for beta-diversity scaling. Here, the PDO 
profile, which characterizes the overlap or similarity between 
pair-wise individuals, showed the opposite trend to the DAR 
profile, i.e., a bell-shaped trend (see Figure 1B for beta-PDO 
profile). 

(3) The MAD profile characterizes the theoretical maximal 
accumulation of diversity across individuals. Here, regarding 
the beta-MAD profile, the PLEC model failed to produce Dmax 
at diversity order g=0 because d>0, for which a maximum 


does not exist. For diversity orders q=1, 2, 3, the PLEC model 
for beta-diversity successfully generated D,,, i. €., Dmax(Q) = 
(15.5(1), 17.3(2), 21.3(3)). 

Table 3 shows the RIP values for both alpha-DAR and beta- 
DAR of the vaginal microbiome. At diversity order g=0, the 
estimation of "Das failed, and RIP for g-0 could not be 
estimated. For q=1, 2, 3, RIP was successfully estimated for 
alpha- and  beta-diversity, respectively. Here, RIP 
characterized the relationship between  individual- and 
population-level diversity. For example, at diversity order q-1, 
alpha-RIP=0.327 and beta-RIP=0.316, indicating that an 
average individual represented approximately 33% and 3296 
of population alpha- and beta-diversity, respectively. 

In the current study, we investigated the diversity (including 
alpha- and beta-diversity) scaling of the human vaginal 
microbiome across individuals by re-analyzing a big dataset 


Table 3 Ratio of individual-level to population-level diversity 
(RIP) of the vaginal microbiome 


Diversity order RIP for alpha-DAR RIP for beta-DAR 





q-0 N/A N/A 

q=1 0.327 0.316 
q=2 0.482 0.383 
q=3 0.530 0.335 


N/A: Not available. 
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originally published by Doyle et al. (2018). Compared with the 
microbial SAR range reported in existing literature for other 
microbes, such as Green & Bohannan’s (2006) range 
between 0.019—0.470, the scaling parameter (z) estimated in 
our study, i.e., alpha-z-0.807, beta-z-0.805, appears to be out 
of the known range, at nearly twice that reported for SAR 
values for other microbes. Three possibilities exist for the 
significant difference: (1) The use of revolutionary 
metagenomic sequencing technology, which allows for 
detection of more microbial species and consequently large 
scaling parameter; (2) The human vaginal microbiome has 
higher heterogeneity across individuals, which could be 
validated by future biomedical studies; and (3) The 
postpartum nature of the vaginal microbiome samples 
analyzed in this study. We could not exclude these 
possibilities at present due to insufficient available data for 


comparative research. Indeed, previous studies have 
classified human vaginal microbiomes into five main 
community-state types (CSTs), in which CST |, Il, Ill, and V 


are dominated by Lactobacillus spp., and CST IV is composed 
of facultative or strictly anaerobic bacteria (Gajer et al., 2012; 
Ravel et al., 2011), many of which are BV-related. The vaginal 
microbial communities of postpartum women in rural Malawi 
studied by Doyle et al. (2018) and reanalyzed here were 
mostly Lactobacillus-deficient microbiomes, which could be 
grouped as CST IV, although all these women were healthy. 
Therefore, the classification of CSTs may be more complex 
than initially conceived. Consequently, our DAR analysis 
based on Doyle et al. (2018) may be limited by the datasets of 
postpartum women, and the DAR parameters of the vaginal 
microbiomes of other CST women are likely different from the 
results reported here. Further studies should be performed to 
clarify this important issue. 

The major findings in this study can be summarized using 
four profiles: i. e., DAR profile, characterizing the change 
(scaling) in diversity heterogeneity across individuals; PDO 
profile, characterizing the pair-wise similarity (overlap) 
between individuals; MAD profile, characterizing the maximal 
accrual diversity in a population; and RIP profile, 
characterizing the ratio of individual-level diversity to 
population-level diversity. Theoretically, the four profiles can 
together summarize the essential characteristics of the spatial 
distribution of vaginal microbial diversity and offer effective 
tools to sketch out the biogeographic maps of the human 
vaginal microbiome. Practically, they are essentially 
quantitative metrics of diversity heterogeneity across 
individuals from different dimensions (diversity scaling, pair- 
wise similarity in diversity, maximal accrual diversity, ratio of 
individual to population diversity). These multidimensional 
metrics could provide more comprehensive tools for 
understanding the implications of vaginal microbial diversity to 
women’s health, including the DSR hypothesis for BV etiology 
(Ma et al., 2012, Ma & Ellison 2018, 2019 Sobel, 1999). In 
addition, the quantitative models of the four profiles obtained 
here could be harnessed to assess and predict microbiome 
diversity changes at the population scale and are of potential 
significance for evaluating women’s health associated with 
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vaginal microbiomes. 
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